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(57) Abstract: This invention provides 
methods for producing plants with 
£J £2 Z£ i£ ?£ ^7! r^£j£ifi! increased seed and biomass production. 

SSgggggg SSSS ^fca,^ invention pro^s 
' r 1 ^» ****** ***** ******* methods for producing plants which 

have increased yields for a number of 
plant traits, including seed number, seed 
weight, the number of seed heads, flag 
leaf weight and total plant weight. This 
invention also provides methods for 
improving the Harvest Index of a plant. 
In a preferred embodiment, the methods 
comprise introducing into a plant a 
nucleic acid wherein the nucleic acid is 
selected from the group consisting of a 
nucleic acid comprising SEQ ID NO: 3, 
a nucleic acid which hybridizes with SEQ 

ID NO: 3 under high stringency conditions and encodes a polypeptide that retains biological activity of SH2-REV6-HS, a fragment 
of SEQ ID NO: 3 encoding a peptide that retains biological activity of SH2-REV6-HS, a nucleic acid encoding a polypeptide 
comprising SEQ ID NO: 4, or a fragment thereof that retains biological activity of SH2-REV6-HS, and a nucleic acid encoding an 
SH2HS or an SH2RTS polypeptide. The present invention also concerns plants obtained by the methods provided hereia 
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INSCRIPTION 

t F A Msr,F.NTC PLANTS WITH INCREASED SHED YIELD, 
BIO-MASS w waevkst INDEX 

FiplH nf the Invention 
This invention relates to improving plant production, both plant seed production 
and plant biomass production. More specifically, this invention relates to transgenic 
plants which have increased seed production and increased biomass production when 
compared to non-transgenic plants of the same genetic background. Even more 
specifically, this invention relates to plants which are transgenic for Sh2-Rev6-HS and to 
methods for producing such plants. 

Background of the Invention 
ADP glucose pyrophosphorylase (AGP) is one of the primary enzymes involved 
in the biosynthesis of starch and glycogen in organisms such as plants, algae, fungi and 
bacteria, particularly plants. AGP catalyzes the following reaction: 

a-glucose- 1 -P+ATP ADP-glucose+PP! . 

ADP-glucose, the product of the above reaction, is the major donor of glucose in the 
biosynthesis of starch in plants and in the biosynthesis of glycogen by bacteria. 

AGP is widely distributed throughout the plant kingdom. It is present in 
monocots such as wheat, rice, barley, and maize, as well as dicots such as spinach, potato, 
and pea. It is also found in some starch producing bacteria, such as E. coli. Plant AGP 
exists as a tetramer (210 to 240 kDa) composed of two small sub-units (50 to 55 kDa) 
and two large sub-units (51 to 60 kDa) in contrast to bacterial AGP which appears to 
consist of four units of equal size. AGP has also been shown to be produced in 
cyanobacteria and in algae, where its tetrameric structure is similar to that in plants, i.e. 
two large and two small sub-units, rather than the homotetrameric structure found in 
ordinary bacteria. 
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Because of the commercial importance of starch, primarily as a foodstuff but also 
as an important industrial chemical, much work has been done to isolate and characterize 
the nucleic acid encoding AGP. Plant AGP consists of two different protein subunits. 
In maize endosperm, AGP is encoded by the Shrunken-2 (Sh2) and Brittle-2 (Bt2) genes 
5 (Bhave et al, 1990 and Bae et al, 1990). Sh2 encodes the large subunit having a 

predicted molecular weight of 57,179 Da, while Bt2 encodes the small subunit having a 
molecular weight of 52,224 Da. The isolation of nucleic acids encoding AGPs from 
various other plants has also been reported: the small subunit cDNA (Anderson et al, 
1989) and the genomic DNA (Anderson et al, 1991) from rice; the small and large 

10 subunit cDNAs from spinach leaf (Morell et al., 1988); and the small and large subunit 
cDNAs from potato tuber (Muller-Rober et al, 1990; and Nakata et al, 1991). 

Moreover, work has been done to alter AGP expression in plants in order to 
regulate starch synthesis. EP 455,316 provides a plasmid that comprises a DNA 
encoding AGP placed in an inverted orientation, which results in the transcription of the 

15 anti-sense mRNA in a host plant. The patent shows that transgenic potatoes comprising 

the plasmid has reduced AGP activity and reduced starch concentration compared to non- 
transformed plants. U.S. Patent No. 5,773,693 discloses a method of increasing sucrose 
content of pea plant by suppressing or reducing the expression of either or both subunits 
of AGP. The method comprises transforming a pea plant with a plasmid comprising 

20 nucleic acid encoding the Sh2 subunit or the Bt2 or both subunits in antisense orientation 
to the promoter and the terminator. 

In contrast, U.S. Patent No. 5,977,437 teaches a method of increasing the rate and 
/or yield of starch production in a plant comprising introducing into a plant, a nucleic acid 
encoding barley endosperm AGP operably linked to aplastid transit peptide. EP 634,491 

25 discloses a method of decreasing oil content in seed by increasing the amount of starch 

comprising transforming a plant cell with a nucleic acid comprising a promoter, and a 
. _ DNA encoding affusion protein comprising an amino terminal plastid transit peptide, an 
AGP enzyme, and. a 3.' non-translated transcription. termination sequence, obtaining, 
transformed plant cells, and regenerating transformed plants from the transformed plant 

30 cells. Finally, U.S. Patent No. 5,792,290 discloses the nucleic acid encoding wheat AGP 
and teaches inserting extra copies of the AGP gene into a plant genome by transformation 
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to enhance starch production and inserting the complement of the mRUA encoding the 
endogenous AGP to reduce starch production. 

The matze endosperm is the site of most starch deposition during kernel 
development. Sh2 and Bt2 maize endosperm mutants have greatly reduced starch levels 
5 corresponding to deficient levels of AGP activity. Mutations of either gene have been 
shown to reduce AGP activity by about 95% (Tsai et al, 1966; Dickinson et al, 1969). 
Lack of AGP and a decrease in starch levels compared to that of the wild-type endosperm 
result in shrunken, brittle, and/or collapsed kernels at seed maturity. Furthermore, it has 
been observed that enzymatic activities increase with the dosage of functional wild-type 
10 Sh2 and Bt2 alleles, whereas mutant enzymes have altered kinetic properties. 

AGP is the rate limiting step in starch biosynthesis in plants. Stark et al placed 
a mutant form of £ coli AGP in potato tuber and obtained a 35% increase in starch 
content (Stark et al, 1992). AGP is an allosteric enzyme, i.e. its activity is regulated 
through the binding of an effector to an allosteric site. In plants, the positive effector of 
15 AGP is 3-phosphoglycerate (3-PGA), and the negative effector is phosphate (D.ckmson 
et al, 1969). Inhibition of AGP by phosphate is likely the largest limitation on starch 
biosynthesis in plants (Giroux et al, 1996). 

Giroux et al (1996; U.S. Patent Nos. 5,872,216 and 5,589,618, each of which 
is hereby incorporated by reference in their entireties) used in vivo, site-specific 
20 mutagenesis to create short insertion mutations in a region of the gene known to be 
involved in the allosteric regulation of AGP. Single mutations of the Sh2 gene containing 
an insertion of an additional tyrosine or serine residue reduced total AGP activity and the 
amount of SH2 protein. A specific revertant containing an additional tyrosine residue 
and an additional serine residues increased seed weight 1 1-18%. This later revertant were 
25 named "Sh2-mlRev<?> (this gene is designated as «Sh2-RevS> herein). Giroux et al 
(1996) also found that the increase in seed weight of the Sh2-mlRev6 was not solely 
- attributable to an increase in starch content, although there was an increase in the absolute 
starch content in the variant expressing Sh2-mlRev6. Giroux et al. (1996) suggested that 
the enhanced starch synthesis caused by Rev6 creates a stronger sink within the seed 
30 leading to increased synthesis of other seed components. Mutations in AGP conferring 
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increased heat stability to a plant expressing the mutant AGP are disclosed in U.S. Patent 
No. 6,069,300 and published PCT application WO 99/58698. 

Modulation of the sink strength of a plant is one of the methods to increase 
harvest yield. The leaves and other green tissue active in photosynthesis are commonly 

5 referred to as the "source", and those parts in which storage occurs are referred to as the 
"sink". In cereals such as maize, rice, and wheat, the primary sink is the endosperm, and 
individual seed weight is the primary determinant of the yield of corn (Duvick et aL, 
1992). As evidenced by Giroux et aL (1996), rendering the maize endosperm AGP 
insensitive to phosphate inhibition, increases individual seed weight without dramatically 

10 affecting starch content (U.S. Patent Nos. 5,650,557 and 5,872,2 1 6). 

Over the years, the desire for high biological yield has aroused an interest in 
manipulating plant structure in order to obtain plants where the economically useful part 
forms are as large a proportion of the plant as is consistent with acceptable plant vigor 
and health. Attempts to increase yield by altering the relative contribution of the different 

15 components of grain or kernel yield, such as ears or heads per plant, grains per head or 

kernels per ear, grain size or kernel size, etc., have proven unsuccessful because increases 
in one component tend to be accompanied by reductions in another (Wilson, D. (1981) 
Plant Breeding H K. Frey Edited, Iowa, Iowa State University Press, page 255). 
However, yield increases due to an increase in the proportion of grain relative to 

20 vegetative parts have been common in the cereal crops (Wilson, D. (1 98 1) Plant Breeding 
IL K. Frey Edited, Iowa, Iowa State University Press, page 255). 

Langer and Hill (Langer, R. H. M. and Hill, G. D. (1991) Agricultural Plants. 
Second Edition. Cambridge, Cambridge University Press, page 341) state that higher 
yields can be achieved by improving the Harvest Index (HI), since HI links biological 

25 yield (Y WoI ) and economic yield (Y ccon ) in the following manner : 

Y bio j x HI — Y^j, 

■ ' • — it is pointed out that treatments affecting HI will also affect Y bio , but not necessarily to the 
" ' T * " same extent or in the same direction. For instance, in cereals it is possible to increase 

biological yields by applying nitrogen at high population densities in the presence of 
30 adequate water. The expected result is heavy vegetative growth, but reduced light 

transmission into the canopy, poor grain set and development will lead to a low Harvest 
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Index. In contrast, short-strawed cereals are characterized by greater Harvest Index. 
Short, erect cultivars of rice yielding 4-5 t ha" 1 have been shown to have a Harvest Index 
of about 0.53 to 0.56, compared with 0.39 to 0.42 for tall, leafy cultivars with a grain 
yield of about 2.4 t ha 1 (Langer, R. H. M. and Hill, G. D. (1991) Agriculture! Plants. 
5 Second Edition, Cambridge, Cambridge University Press, page 34 1 ). Likewise, in wheat, 

the dwarf and semidwarf cultivars emanating from the Mexican plant breeding program 
have a higher Harvest Index. However, short plants may also produce little grain. Thus, 
it is necessary to assess both the biological yield and Harvest Index in plant breeding 
programs. 

10 The present invention provides a method of increasing the seed production and 

the biomass production of plants. More specifically, this invention provides transgenic 
plants which have increased total seed number, increased individual seed weight, 
increased total seed weight per plant, as well as increased above-ground plant biomass 
and an improved Harvest Index when compared to non-transgenic plants of the same 

1 5 genetic background. The production of plants with increases in all of these parameters 

as the result of a transgene is quite unexpected in view of normal source/sink 
relationships in plants. 

Brief Summa ry of the Invention 
This invention provides methods of producing plants which have improved plant 
production, both plant seed production and plant biomass production. This invention also 
provides the plants produced by the disclosed methods, wherein the plants are 
monocotyledonous plants and dicotyledonous plants. 

More specifically, this invention provides methods for increasing the number of 
seeds produced by plants, increasing the biomass produced by plants, or increasing the 
Harvest Index of plants by introducing into such plants a nucleic acid operably linked to 
a promoter, wherein the nucleic acid is that of SH2-RE V6-HS (SEQ ID NO: 3), a nucleic 
acid which hybridizes with SH2-REV6-HS under high stringency conditions and encodes 
a polypeptide that retains biological activity of the protein SH2-REV6-HS (SEQ ID NO: 
4), a fragment of SH2-REV6-HS encoding a peptide that retains biological activity of 
SH2-REV6-HS, a nucleic acid encoding a polypeptide comprising SEQ ID NO:4, or a 



20 



25 



30 



WO 01/64928 



PCT/USO 1/06622 



6 

fragment thereof that retains biological activity of SH2-REV6-HS, or a nucleic acid 
encoding an SH2HS or an SH2RTS polypeptide. Preferably, the SH2HS polypeptide is 
the SH2HS33 polypeptide. The methods further include growing the plants produced by 
such methods. The present invention also includes the plants produced by such methods. 

The methods of the present invention are applicable to monocotyledonous plants, 
such as rice, wheat, barley, oats, sorghum, and millet, and dicotyledonous plants, such 
as peas, alfalfa, birdsfoot trefoil, chickpea, chicory, clover, kale, lentil, prairie grass, 
small burnet, soybean, and lettuce. 

This invention also provides methods of increasing the flag leaf weight of 
monocotyledonous plants by introducing into such plants a nucleic acid operably linked 
to a promoter, wherein the nucleic acid is that of SH2-REV6-HS (SEQ ID NO: 3), a 
nucleic acid which hybridizes with SH2-REV6-HS under high stringency conditions and 
encodes a polypeptide that retains biological activity of the protein SH2-REV6-HS (SEQ 
ID NO: 4), a fragment of SH2-REV6-HS encoding a peptide that retains biological 
activity of SH2-REV6-HS, a nucleic acid encoding a polypeptide comprising SEQ ID 
NO:4, or a fragment thereof that retains biological activity of SH2-REV6-HS, or a 
nucleic acid encoding an SH2HS or an SH2RTS polypeptide. Preferably, the SH2HS 
polypeptide is the SH2HS33 polypeptide. The methods further include growing the 
plants produced by such methods. The present invention also includes the plants 
produced by such methods. 

The invention also provides methods of increasing the number of seed heads 
produced by monocotyledonous plants by introducing into such plants a nucleic acid 
operably linked to a promoter, wherein the nucleic acid is that of SH2-REV6-HS (SEQ 
ID NO: 3), a nucleic acid which hybridizes with SH2-REV6-HS under high stringency 
conditions and encodes a polypeptide that retains biological activity of the protein SH2- 
REV6-HS (SEQ ID NO: 4), a fragment of SH2-REV6-HS encoding a peptide that retains 
biological-activity of SH2-REV6-HS,- a nucleic -acid encoding a polypeptide comprising 
SEQ ID NO:4, or a fragment thereof that retains biological activity of SH2-REV6-HS, 
or a nucleic acid encoding an SH2HS or an SH2RTS polypeptide. Preferably, the SH2HS 
polypeptide is the SH2HS33 polypeptide. The methods further include growing the 



PCTAJS01/06622 

WO 01/64928 



plants produced by such methods. The present invention also includes the plants 

produced by such methods. 

The invention also provides methods of increasing two or more traits of 
dicotyledonous plants by introducing into such plants a nucleic acid operably linked to 
5 a promoter, wherein the nucleic acid is that of SH2-REV6-HS (SEQ ID NO: 3), a nucleic 
acid which hybridizes with SH2-REV6-HS under high stringency conditions and encodes 
a polypeptide that retains biological activity of the protein SH2-REV6-HS (SEQ ID NO: 
4), a fragment of SH2-REV6-HS encoding a peptide that retains biological activity of 
SH2-REV6-HS, a nucleic acid encoding a polypeptide comprising SEQ ID NO:4, or a 
10 fragment thereof that retains biological activity of SH2-REV6-HS, or a nucleic acid 
encoding an SH2HS or an SH2RTS polypeptide. Preferably, the SH2HS polypeptide is 
the SH2HS33 polypeptide. The methods further include growing the plants produced by 
such methods. The present invention also includes the plants produced by such methods. 
The invention further provides methods of increasing the yield of two or more 
15 traits of monocotyledonous plants by introducing into such plants a nucleic acid operably 
linked to a promoter, wherein the nucleic acid is that of SH2-REV6-HS (SEQ ID NO: 3), 
a nucleic acid which hybridizes with SH2-REV6-HS under high stringency conditions and 
encodes a polypeptide that retains biological activity of the protein SH2-REV6-HS (SEQ 
ID NO: 4), a fragment of SH2-REV6-HS encoding a peptide that retains biological 
20 activity of SH2-REV6-HS, a nucleic acid encoding a polypeptide comprising SEQ ID 
NO:4, or a fragment thereof that retains biological activity of SH2-REV6-HS, or a 
nucleic acid encoding an SH2HS or an SH2RTS polypeptide. Preferably, the SH2HS 
polypeptide is the SH2HS33 polypeptide. The methods further include growing the 
plants produced by such methods. The present invention also includes the plants 

25 produced by such methods. 

The present invention further includes crossing the plants obtained by the above- 
methods to one- or more other plants and harvesting and growing the seed which is 
produced as a result of making the crosses. 

The present invention further includes harvesting seed which produced by selfing 
30 the plants obtained by the above methods and growing the harvested seed. 
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The present invention provides plants which include a nucleic acid encoding the 
amino acid sequence of SH2-REV6-HS (SEQ ID NO: 4), or a fragment thereof that 
retains biological activity of SH2-REV6-HS. 

The present invention provides plants which include a nucleic acid encoding the 
5 amino acid sequence of an SH2HS or an SH2RTS protein, or a fragment thereof that 
retains biological activity of an SH2HS or an SH2RTS protein. In a preferred 
embodiment, the SH2HS polypeptide has the amino acid sequence of SH2HS33. 

Brief Description of the Drawing 
10 Figure 1 shows a Northern blot analysis of Sh2-Rev6-HS transgenic rice lines. 

Detailed Description of the Invention 

I. Definitions 

As used herein, the term "AGP" means ADP glucose pyrophosphorylase. 
1 5 As used herein, the term "allele" means any of several alternative forms of a gene. 

As used herein, the term "biological activity" means any functional activity of an 
SH2 mutant polypeptide of the invention, such as the SH2-REV6, SH2HS33, and SH2- 
REV6-HS polypeptides. The functional activity of the subject polypeptides includes but, 
is not limited to, increasing total seed number, increasing individual seed weight, 
20 increasing total seed weight per plant, increasing above-ground plant biomass, increasing 
Harvest Index, and phosphate insensitivity, and increased heat stability. 

As used herein, the term "Bt2" means the Brittle-2 gene encoding the small 
subunit of AGP. As used herein, the term "6/2" means a mutant form of the Bt2 gene, 
which renders the kernels of corn brittle in texture upon drying. 
25 As used herein, the term "cereal" means, depending on the context, either: 1) a 

grass plant, such as corn, or 2) the grain of a grass plant. 
- - . As-used herein, the term "crop plant" means any plant grown- for any commercial 

purpose, including, but not limited to the following purposes: seed production, grain 
production, hay production, ornamental use, fruit production, berry production, vegetable 
30 production, oil production, protein production, forage production, silage, animal grazing, 

golf courses, lawns, flower production, landscaping, erosion control, green manure, 
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improving soil tilth/health, producing pharmaceutical products/drugs, producing food 
additives, smoking products, pulp production and wood production. Particular crop 
plants of interest to the present invention include, but are not limited to, wheat, rice, 
maize, barley, rye, sugar beets, potatoes, sweet potatoes, soybeans, cotton, tomatoes, 

5 canola and tobacco. 

As used herein, the term "cross pollination" or "cross-breeding" means the pollen 
of one flower on one plant is applied (artificially or naturally) to the ovule (stigma) of a 

flower on another plant. 

As used herein, the term "cultivar" means a variety, strain or race of plant which 
1 0 has been produced by horticultural or agronomic techniques and is not normally found 
in wild populations. 

As used herein, the terms "Dicotyledoneae", "dicotyledonous", "dicotyledon" or 
"dicot" are synonymous and mean any of various flowering plants having two embryonic 
seed leaves or cotyledons that usually appear at germination. Examples include, but are 
15 not limited to, tobacco, soybeans, potato, sweet potato, radish, cabbage, rape and apple 
trees. 

As used herein, the term "flag leaf refers to the uppermost leaf on a fruiting 
(fertile) culm; the leaf immediately below the inflorescence or seed head. 

As used herein, the term "genotype" means the genetic makeup of an individual 
20 cell, cell culture, plant, or group of plants. 

As used herein, the term "grain" means, depending on its context, either: 1) the 
cereal grasses considered as a group, or 2) the fruit of one or more of the cereal grasses. 

As used herein, the terms "grass" or "grasses" mean a plant belonging to the 
family Poaceae. 

25 As used herein, the term "Harvest Index" is the proportion of total plant mass 

harvested. It is the ratio of weight of grain/ (weight of grain plus weight of plant). This 
- is identical to HI as discussed elsewhere herein (see, also, Langer and Hill, 1991), 
wherein HI links biological yield and economic yield, and HI is the ratio of economic 
yield/biological yield. The economic yield (Y econ ) is the weight of grain, while the 

30 biological yield (Y Wol ) is the weight of grain plus weight of plant. The weight of grain is 
synonymous with the total seed weight. 
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As used herein, the term "heterozygote" means a diploid or polyploid individual 
cell or plant having different alleles (forms of a given gene) at least at one locus. 

As used herein, the term "heterozygous" means the presence of different alleles 
(forms of a given gene) at a particular gene locus. 
5 As used herein, the term "homozygote" means an individual cell or plant having 

the same alleles at one or more loci. 

As used herein, the term "homozygous" means the presence of identical alleles 
at one or more loci in homologous chromosomal segments. 

As used herein, the term "hybrid" means any individual plant resulting from a 
1 0 cross between parents that differ in one or more genes. 

As used herein, the term "inbred" or "inbred line" means a relatively true- 
breeding strain. 

As used herein, a nucleic acid molecule is said to be "isolated" when the nucleic 
acid molecule is substantially separated from contaminant nucleic acid encoding other 
1 5 polypeptides from the source of nucleic acid. 

As used herein, the term "line", when directed to a type of plant, means self- or 
cross-fertilizing plants and single-line facultative apomicts, having largely the same 
genetic background, that are similar in essential and distinctive characteristics. 

As used herein, the term "locus" (plural: "loci") means any site that has been 
20 defined genetically. A locus may be a gene, or part of a gene, or a DNA sequence that 

has some regulatory role, and may be occupied by different sequences. 

As used herein, the term "mass selection" means a form of selection in which 
individual plants are selected and the next generation propagated from the aggregate of 
their seeds. 

25 As used herein, the terms "Monocotyledoneae", "monocotyledonous", 

"monocotyledon" or "monocot" are synonymous and mean any of various flowering 
' plants having a singlecbfyleddn in'the seed.' Example's "of mdnocofe include, but are' not 
limited to, rice, wheat, barley; maize and lilies. 

As used herein, the term "Northern Blot" refers to the analysis of RNA by 
30 electrophoresis of RNA on agarose gels to fractionate the RNA according to size 
followed by transfer of the RNA from the gel to a solid support, such as nitrocellulose 
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or a nylon membrane. The immobilized RNA is then probed with a labeled probe to 
detect RNA species complementary to the probe used. Northern blots are a standard tool 
of molecular biologists (Sambrook et ai, Molecular Cloning: A Laboratory Manual , 2nd 
edition, Cold Spring Harbor Laboratory Press, 1985). 
5 As used herein, the term "open pollination" means a plant population that is freely 

exposed to some gene flow, as opposed to a closed one in which there is an effective 

barrier to gene flow. 

As used herein, the terms "open-pollinated population" or "open-pollinated 
variety" mean plants normally capable of at least some cross-fertilization, selected to a 
1 0 standard, that may show variation but that also have one or more genotypic or phenotypic 
characteristics by which the population or the variety can be differentiated from others. 
A hybrid which has no barriers to cross-pollination is an open-pollinated population or 
an open-pollinated variety. 

As used herein, the term "ovule" means the female gametophyte, whereas the 
1 5 term "pollen" means the male gametophyte. 

As used herein, the term "phenotype" means the observable characters of an 
individual cell, cell culture, plant, or group of plants which results from the interaction 
between that individual's genetic makeup (i.e., genotype) and the environment. 

As used herein, the term "progeny" means the descendants of a particular plant 
20 (self-cross) or pair of plants (crossed or backcrossed). The descendants can be of the F ls 
the F 2 , or any subsequent generation. Typically, the parents are the pollen donor and the 
ovule donor which are crossed to make the progeny plant of this invention. Parents also 
refer to F, parents of a hybrid plants of this invention (the F 2 plants). Finally, parents 
refer to a recurrent parent which is backcrossed to hybrid plants of this invention to 
25 produce another hybrid plant of this invention. 

As used herein, the term "Polymerase Chain Reaction" is synonymous with 
- A "PGR" and refers to techniques -in which cycles of denaturation, annealing with 
oligonucleotide primers, and extension with DN A polymerase, are used to amplify the 
number of copies of a target DNA sequence. 
30 As used herein, the term "revertant" refers to a mutated Sh2 gene {i.e., mutated 

relative to the wild-type Sh2 gene) wherein the mutant results in a wild-type kernel 
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phenotype (i.e., a plump seed, not a shrunken seed like the phenotype displayed by the 
mutant sh 2sh2 genotype). A revertant genotype would have more AGP activity than a 
sh2sh2 genotype and may have either more or less AGP activity than a wild-type Sh2 
genotype. Typically, the revertants have a wild-type seed phenotype with at least around 
5 30% AGP activity compared to that of a normal (i.e., non-revertant), wild-type. In some 

instances, the term "revertant" may refer to the cell or plant which contains the mutated 
Sh2 gene. 

As used herein, the term "rice" means any Oryza species, including, but not 
limited to, O. sativa, O. glaberrima, O. perennis, O. nivara, and O. breviligulata. Thus, 
10 as used herein, the term "rice" means any type of rice including, but is not limited to, any 
cultivated rice, any wild rice, any rice species, any intra- and inter-species rice crosses, 
all rice varieties, all rice genotypes and all rice cultivars. 

As used herein, the term "self pollinated" or "self-pollination" means the pollen 
of one flower on one plant is applied (artificially or naturally) to the ovule (stigma) of the 
15 same or a different flower on the same plant. 

As used herein, the term "Sh2" refers to the Shrunken-2 gene encoding the large 
subunit of AGP. Sometimes, the term may refer to the cell or plant which contains the 
Sh2 genotype. 

As used herein the term "s/z2" means a mutant form of the Sh2 gene, which 
20 renders the kernels of corn shrunken or collapsed upon drying. Sometimes, the term may 
refer to the cell or plant which contains the sh2 genotype. 

As used herein, the term "Sh2hs" refers to mutants of the Shrunken-2 gene which 
encode heat-stable variants of maize endosperm AGP. Sometimes, the term may refer 
to the cell or plant which contains the Sh2hs genotype. The term "SH2HS" refers to 
25 polypeptides encoded by Sh2hs. A preferred embodiment contemplated by the subject 
invention is the Sh2hs33 gene which encodes the polypeptide referred to herein as 
SH2HS3-3: The* SH2HS33 polypeptide contains" the HS33"mutation disclosed in U.S. • 
• Patent No. 6,069,300 and published PCT application WO 99/58698. Other embodiments 
contemplated for use in the methods of the present invention include, but are not limited 
30 . to, Sh2hsl3, Sh2hsl4, Sh2hsl6, Sh2hs39, Sh2hs40, and Sh2hs47 polynucleotides which 
encode the polypeptides referred to herein as SH2HS13, SH2HS14, SH2HS16, 
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SH2HS39, SH2HS40, and SH2HS47, respectively. The SH2HS13, SH2HS14, 
SH2HS16, SH2HS39, SH2HS40, and SH2HS47 polypeptides contain the HS13, HS14, 
HS16, HS39, HS40, and HS47 mutations, respectively, that are disclosed in U.S. Patent 
No. 6,069,300 and published PCT application WO 99/58698. 

5 As used herein, the term "Sh2rts n refers to temperature sensitive revertant mutants 

of the Shrunken-2 gene which encode heat-stable variants of maize endosperm AGP. 
Sometimes, the term may refer to the cell or plant which contains the Sh2rts genotype. 
The term "SH2RTS" refers to polypeptides encoded by Sh2rts. Examples of 
embodiments contemplated for use in the methods of the present invention include, but 

10 are not limited to, Sh2rts48-2, and Sh2rts60-1 polynucleotides which encode the 
polypeptides referred to herein as SH2RTS48-2 and SH2RTS60-1, respectively. The 
SH2RTS48-2 and the SH2RTS60-2 polypeptides contain the RTS48-2 and RTS60-2 
mutations, respectively, disclosed in U.S. Patent No. 6,069,300 and published PCT 
application WO 99/58698. 

1 5 As used herein, the term u Sh2hs3T refers to a single point mutation in Sh2 which 

increases the stability of maize endosperm AGP through enhanced subunit interactions. 
The mutation is a change from His-to-Tyr at amino acid position 333 (Greene and 
Hannah, 1998). Sometimes, the term may refer to the cell or plant which contains the 
Sh2hs33 genotype. 

20 As used herein, the term "Sh2-Rev6" is synonymous with "Sh2-ml-Rev6" and 

refers to variants of the Shrunken-2 gene. The polypeptide product of the Sh2-Rev6 gene 
contains two additional amino acids, tyrosine and serine, inserted between amino acids 
494 and 495 of the wild-type Sh2 polypeptide. Maize endosperm encoded by Sh2-Rev6 
expresses an AGP that is insensitive to phosphate and results in an increased seed weight 

25 in maize (Giroux et aL, 1 996; U.S. Patent Nos. 5,650,557 and 5,872,2 1 6). Sometimes, 
the term may refer to the cell or plant which contains the Sh2-Rev6 genotype. 

^ . ^ used herein, the term '?$h2-.Rev6-Htr is synonymous with *?Sh2-mJRe»6rHSr. 
and refers to a heat stable variant of the Sh2-Rev6 gene, wherein His is replaced by Tyr 
at position 333. Sometimes, the term may refer to the cell or plant which contains the 

30 Sh2-Rev6-HS genotype. The HS33 mutation of maize AGP, along with other mutations 
conferring heat stability, are disclosed in U.S. Patent No. 6,069,300 and published PCT 
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application WO 99/58698 and are specifically contemplated for use in the methods of the 
present invention. 

As used herein, the term "Sh2hs33" refers to a specific heat stable genetic variant 
of Sh2. The variant contains a His to Tyr mutation at position 333 of the wild-type maize 
5 Sh2 gene (Greene and Hannah, 1998). The mutation renders the maize endosperm AGP 
activity heat-stable. Sometimes, the term may refer to the cell or plant which contains 
the Sh2hs33 genotype. 

As used herein, the phrase "shrunken and brittle" describes the morphology of 
specific types of kernels of a corn. In a brittle and shrunken kernel, the endosperm is 
10 greatly collapsed. The endosperm before drying is like a fluid-filled sac that develops 
with little starch. On drying, the kernel shrinks and collapses into an angular structure 
with marked concavities and brittle texture (Coe et ai, 1988). 

As used herein, the term "synthetic" means a set of progenies derived by 
intercrossing a specific set of clones or seed-propagated lines. A synthetic may contain 
15 mixtures of seed resulting from cross-, self-, and sib- fertilization. 

As used herein, the terms *T„ T 2 , T 3 , . . ." refer to the succeeding generations of 
cells or plants tracing back to a particular tissue culture-derived or transformed cell line 
designated as T 0 , or the parental generation. As regards plants, the plants produced 
directly from the transformed cells are referred to as the T 0 generation. The seeds 
20 produced by selfing the T 0 generation plants are referred to as the Tj seeds. When the T, 
seeds are germinated, the resulting plants are referred to as the T, generation or the T, 
progeny. Seeds produced by the T, generation are referred to as the T 2 seeds. 

As used herein, in grasses, the term "tiller" means a lateral shoot arising at ground 
level. Each of the tillers that were counted in the present studies had a head on the stem 
25 of the shoot. 

As used herein, the term "transformation" means the transfer of nucleic acid (i.e., 
■ "■ '■ w ' "a "nucleotide polymer) intbaxell. As used herein, the -term "getietic r transfonriati6n"- 
' means the transfer aind incorporation of DN A, especially recombinant DNA/into a cell. 

As used herein; the term "transgenic" means cells, cell cultures, plants, and 
30 progeny of plants which have received a foreign or modified nucleic acid sequence by 
one of the various methods of transformation, wherein the foreign or modified nucleic 
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acid sequence is from the same or different species than the species of the plant receiving 
the foreign or modified nucleic acid sequence. The foreign or modified nucleic acid used 
to produce such transgenic cells, cell cultures, plants and progeny of such plants includes 
genes, gene fragments as well as nucleic acid sequences which code for a product which 

5 has at least one biological activity or function. As used herein, the terms "transgenic 

plant" and "transformed plant" are synonymous, as are the terms "transgenic line" and 
"transformed line". As used herein, the phrases "corresponding non-transgenic plant" 
and "corresponding non-transgenic line" refer to the cells, cell cultures, plants and 
progeny of plants which did not receive the foreign or modified gene which the 

10 "transgenic" cells, cell cultures, plants and progeny of plants which did receive the 
foreign or modified gene. 

As used herein, the term "variety" means a subdivision of a species, consisting 
of a group of individuals within the species which are distinct in form or function from 
other similar arrays of individuals. 

15 As used herein, the term "wheat" means any Triticum species, including, but not 

limited to, T. aestivum, T, monococcum, T. tauschii and T. turgidum. Thus, as used 
herein, the term "wheat" means any type of wheat including, but is not limited to, any 
cultivated wheat, any wild wheat, any wheat species, any intra- and inter-species wheat 
crosses, all wheat varieties, all wheat genotypes and all wheat cultivars. Cultivated 

20 wheats include, but are not limited to, einkorn, durum and common wheats. 

As used herein, the term "wild-type" refers to the naturally occurring allele of a 
particular gene. Sometimes the terms refers to the cell or plant containing the wild-type 
alleles of the particular gene. 

25 II. Nucleic Acids Encoding Sh2-Rev6 and Sh2-Rev6-HS 

Giroux et aL (1996) isolated and sequenced genomic DNA and cDNA encoding 
Sh2-Rev6. The nucleotide sequence of Sh2-Rev6 is provided in SEQ ID NO: 1 and the 
amino acid sequence of SH2-REV6 is provided in SEQ ID NO: 2 (see, also, U.S. Patent 
No. 5,650,557 and U.S. Patent No. 5,872,216). Corn seeds that contain at least one 
30 functional Sh2-Rev6 allele have been deposited with the American Type Culture 

Collection (ATCC), 12301 Parklawn Drive, Rockville, MD, 20852 USA, on May 16, 
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1999 and assigned accession number ATCC 97624 (see, column 5 of U.S. Patent Nos. 

5,650,557 and 5,872,216). 

Sh2-Rev6 was further modified by altering His to Tyr at amino acid position 333 

to produce the variant Sh2-Rev6-HS (Greene and Hannah et a/., 1998; U.S. Patent No, 
5 6,069,300). The nucleotide sequence of Sh2-Rev6-HS is provided in SEQ ID NO: 3 and 

the amino acid sequence of SH2-REV6-HS is provided in SEQ ID NO: 4. 

As used herein, Sh2-Rev6, Sh2hs33 y and Sh2-Rev6-HS include the specifically 

identified and characterized variants herein described as well as allelic variants, 

conservative substitution variants and homologues that can be isolated/generated and 
10 characterized without undue experimentation following methods well known to one 

skilled in the art. 

Homology or identity at the amino acid or nucleotide level is determined by 
BLAST (Basic Local Alignment Search Tool) analysis using the algorithm employed by 
the programs blastp, blastn, blastx, tblastn and tblastx (Karlin et aL 9 1990, Proc. Natl. 

15 Acad. Sci. USA 87, 2264-2268 and Altschul, 1993, J, Mol. Evol.36, 290-300, fully 
incorporated by reference) which are tailored for sequence similarity searching. The 
approach used by the BLAST program is to first consider similar segments between a 
query sequence and a database sequence, then to evaluate the statistical significance of 
all matches that are identified and finally to summarize only those matches which satisfy 

20 a preselected threshold of significance. For a discussion of basic issues in similarity 
searching of sequence databases (see Altschul et aL, 1994, Nature Genetics 6, 1 19-129 
which is fully incorporated by reference). The search parameters for histogram, 
descriptions, alignments, expect (i.e., the statistical significance threshold for reporting 
matches against database sequences), cutoff, matrix and filter are at the default settings. 

25 The default scoring matrix used by blastp, blastx, tblastn, and tblastx is the 
BLOSUM62 matrix (Henikoff etai, 1992. Proc. Natl. Acad. Sci. USA 89, 10915-10919, 
' fully lncorporate^by fffererice). For blastn, the scoring imafnxTs set by the ratios bT^ 
(i.e., the rewiard score for a pair of malching residues) to N (ie., the penalty score for 
mismatching residues), wherein the default values for M and N are 5 and -4, respectively. 

30 : The terms "Sh2-Rev6 genes/ 5 "Sh2-Rev6-HS genes," and 4 \S7z2/tf 53 genes" include 

all allelic variants of the Sh2-Rev6 genes, Sh2hs33 genes, and Sh2-Rev6-HS genes 
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exemplified herein, wherein such allelic variants code for proteins which result in one or 
more of the same physiological characteristics as those of the proteins produced by the 
Sh2-Rev6, Sh2hs33, and Sh2-Rev6-HS genes disclosed herein. 

The Sh2-Rev6, Sh2hs33, and Sh2-Rev6-HS nucleic acid molecules or fragments 
thereof utilized in the present invention may also be synthesized using methods known 
in the art. It is also possible to produce the molecule by genetic engineering techniques, 
by constructing DNA using any accepted technique, cloning the DNA in an expression 
vehicle and transfecting the vehicle into a cell which will express the SH2-REV6, 
SH2HS33, and SH2-REV6-HS proteins. See, for example, the methods set forth in 
Sambrook et al, Molecular Cloning: A Laboratory Manual . 2nd edition, Cold Spring 
Harbor Laboratory Press, 1985. 

It is understood that all polynucleotides encoding all or a portion of the 
polypeptides of the present invention, such as the SH2-REV6, SH2HS33, and SH2- 
REV6-HS proteins, are also included herein, as long as they encode a polypeptide with 
one or more of the functional activities of the subject proteins as set forth herein. Thus, 
for example, any polynucleotide fragment having the activities of the SH2-REV6, 
SH2HS33, and SH2-REV6-HS proteins discussed herein are encompassed by the present 
invention. 

Polynucleotide sequences of the invention include DNA, cDN A, synthetic DNA 
and RNA sequences which encode polypeptides of the present invention, such as, for 
example, SH2-REV6, SH2HS33, and SH2-REV6-HS proteins. Such polynucleotides 
also include naturally occurring, synthetic and intentionally manipulated polynucleotides. 
For example, such polynucleotide sequences may include genomic DNA which may or 
may not include naturally occurring introns. Moreover, such genomic DNA may be 
obtained in association with promoter regions or poly A sequences. As another example, 
portions of the mRNA sequence may be altered due to alternate RNA splicing patterns 
or the use- of alternate promoters for RNA transcription. - As yet another example, Sh2- 
Rev6; Sh2hs33 9 and Sh2-Rev6-HS polynucleotides may be subjected to additional 
mutations using, for example, site-directed mutagenesis and DNA shuffling. 

The polynucleotides of the invention further include sequences that are degenerate 
as a result of the genetic code. The genetic code is said to be degenerate because more 
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than one nucleotide triplet can code for the same amino acid. There are 20 natural amino 
acids, most of which are specified by more than one codon. It will be appreciated by 
those skilled in the art that as a result of the degeneracy of the genetic code, a multitude 
of nucleotide sequences, some bearing minimal nucleotide sequence homology to the 
nucleotide sequences of the subject polynucleotides, such as Sh2-Rev6, Sh2hs33, and 
Sh2-Rev6-HS, may be utilized in the present invention. Therefore, all degenerate 
nucleotide sequences are included in the invention as long as the amino acid sequence of 
the subject polypeptides, for example, the SH2-REV6, SH2HS33, and SH2-REV6-HS 
polypeptides, encoded by the nucleotide sequence are functionally unchanged or 
substantially similar in function. The invention specifically contemplates each and every 
possible variation of peptide or nucleotide sequence that could be made by selecting 
combinations based on the possible amino acid and codon choices made in accordance 
with the standard triplet genetic code as applied to polynucleotide sequences of the 
invention, as exemplified by Sh2-Rev6, Sh2hs33, and Sh2-Rev6-HS, and all such 
variations are to be considered specifically disclosed herein. 

Also included in the invention are fragments (portions, segments) of the 
sequences disclosed herein which selectively hybridize to polynucleotides of the present 
invention, such as, for example, Sh2-Rev6, Sh2hs33 y and Sh2-Rev6-HS. Selective 
hybridization as used herein refers to hybridization under stringent conditions (See, for 
example, the techniques in Maniatis et al (1989) Molecular Cloning: A Laboratory 
Manual . Cold Spring Harbor Laboratory Press), which distinguishes related from 
unrelated nucleotide sequences. The active fragments of the invention, which are 
complementary to mRNA and the coding strand of DNA, are usually at least about 15 
nucleotides, more usually at least 20 nucleotides, preferably 30 nucleotides and more 
preferably may be 50 nucleotides or more. 

"Stringent conditions" are those that (1) employ low ionic strength and high 
• --temperature-for-wasta pH -7,2 f I mM- 

. EDTA pH 8.0 in 7% SDS at either 65 9 C or 55°C, or (2) employ during hybridization a 
denaturing agent such as formamide, for example, 50% (vol/vol) formamide with 0.1% 
bovine serum albumin, 0.1% Ficoll, 0.1% polyvinylpyrrolidone, 0.05 M sodium 
phosphate buffer at pH 6.5 with 0.75 M NaCl, 0.075 M sodium citrate at 42°C. A 
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specific example includes the use of 50% formamide, 5x SSC (0.75 M NaCl, 0.075 M 
sodium citrate), 50 mM sodium phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5* 
Denhardt's solution, sonicated salmon sperm DNA (50 jig/ml), 0.1% SDS, and 10% 
dextran sulfate at 55°C, with washes at 55°C in 0.2* SSC and 0.1% SDS. A skilled 
artisan can readily determine and vary the stringency conditions appropriately to obtain 
a clear and detectable hybridization signal. Preferred molecules are those that hybridize 
under the above conditions to the complements of Sh2-Rev6, Sh2hs33, and Sh2-Rev6-HS 
and which encode a functional protein. 

The present invention utilizes nucleic acid molecules encoding the subject SH2 
mutant proteins, such as SH2-REV6, SH2HS33, and SH2-REV6-HS, which hybridize 
with nucleic acid molecules comprising sequences complementary to the subject 
polynucleotides encoding SH2-REV6, SH2HS33, and SH2-REV6-HS under conditions 
of sufficient stringency to produce a clear signal. As used herein, "nucleic acid" is 
defined as RNA or DNA encoding polypeptides of the invention, such as, for example, 
SH2-REV6, SH2HS33, and SH2-REV6-HS polypeptides, or RNA or DNA sequences 
which are complementary to nucleic acids encoding such peptides, or RNA or DNA 
sequences which hybridize to such nucleic acids and remain stably bound to them under 
stringent conditions, or RNA or DNA sequences which encode polypeptides sharing at 
least 60% sequence identity, or at least 65% sequence identity, or at least 70% sequence 
identity, or at least 75% sequence identity, or at least 80% sequence identity, or at least 
85% sequence identity, preferably at least 90% sequence identity, and more preferably 
at least 95% sequence identity with proteins of the present invention, such as SH2-REV6, 
SH2HS33, and SH2-REV6-HS. 

The present invention further provides fragments of any one of the encoding 
nucleic acids molecules. As used herein, a fragment of an encoding nucleic acid 
molecule refers to a small portion of the entire protein coding sequence. The size of the 
-fragment will be determined by the intended use. For example, if the fragment is chosen 
so as to encode an active portion of the protein, the fragment will need to be large enough 
to encode the functional region(s) of the protein. For instance, fragments of the invention 
encode the domains or regions of the SH2-REV6, SH2HS33, and SH2-REV6-HS of the 
present invention which are involved with the allosteric regulation of AGP. If the 
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fragment is to be used as a nucleic acid probe or PCR primer, then the fragment length 
is chosen so as to obtain a relatively small number of false positives during probing and 
priming. 

Fragments of the encoding nucleic acid molecules of the present invention (i.e., 
5 synthetic oligonucleotides) that are used as probes or specific primers for the polymerase 
chain reaction (PCR), or to synthesize gene sequences encoding proteins of the invention 
can easily be synthesized by chemical techniques, for example, the phosphotriester 
method of Matteucci et al (1981) J. Am. Chem. Soc. 103, 3185-3191) or using 
automated synthesis methods. In addition, larger DN A segments can readily be prepared 
10 by well known methods, such as synthesis of a group of oligonucleotides that define 
various modular segments of the gene, followed by ligation of oligonucleotides to build 
the complete modified gene. 

The encoding nucleic acid molecules of the present invention may further be 
modified so as to contain a detectable label for diagnostic and probe purposes. A variety 
15 of such labels are known in the art and can readily be employed with the encoding 
molecules herein described. Suitable labels include, but are not limited to, biotin, 
radiolabeled nucleotides and the like. A skilled artisan can employ any of the art known 
labels to obtain a labeled encoding nucleic acid molecule. 

Modifications to the primary structure itself by deletion, addition, or alteration of the 
20 amino acids incorporated into the protein sequence during translation can be made without 
destroying the activity of the protein. Such substitutions or other alterations result in proteins 
having an amino acid sequence encoded by a nucleic acid falling within the contemplated 
scope of the present invention. 

25 ffl. Isolation of Other Related Nucleic Acid Molecules 

As described herein, the identification and characterization of the nucleic acid 
~ molecules of the present invention, such as those encoding an* SH2-REV6;*SH2HS33; or 
* SH2-REV6-HS protein, or a fragment of an *SH2-REV6, SH2HS33, or*SH2-REV6-HS 
protein, allows a skilled artisan to isolate nucleic acid molecules that encode other members 
30 of the protein family in addition to the sequences herein described. Further, the presently 
disclosed nucleic acid molecules allow a skilled artisan to isolate nucleic acid molecules that 
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encode other members of the family of proteins in addition to the SH2-REV6, SH2HS33, and 
SH2-REV6-HS disclosed herein. 

Essentially, a skilled artisan can readily use any one of the amino acid sequences 
disclosed herein to generate antibody probes to screen expression libraries prepared from 
5 appropriate cells. Typically, polyclonal antiserum from mammals such as rabbits immunized 
with the purified protein or monoclonal antibodies can be used to probe a cDNA or genomic 
expression library to obtain the appropriate coding sequence for other members of the protein 
family. The cloned cDNA sequence can be expressed as a fusion protein, expressed directly 
using its own control sequences, or expressed by constructions using control sequences 
10 appropriate to the particular host used for expression of the enzyme. 

Alternatively, a portion of the coding sequence herein described can be synthesized 
and used as a probe to retrieve DNA encoding a member of the protein family from any 
organism. Oligomers containing approximately 1 8-20 nucleotides (encoding about a six to 
seven amino acid stretch) are prepared and used to screen genomic DNA or cDNA libraries 
15 to obtain hybridization under stringent conditions or conditions of sufficient stringency to 

eliminate an undue level of false positives. 

Additionally, pairs of oligonucleotide primers an be prepared for use in a polymerase 
chain reaction (PCR) to selectively clone an encoding nucleic acid molecule. A PCR 
denature/anneal/extend cycle for using such PCR primers is well known in the art and can 
20 readily be adapted for use in isolating other encoding nucleic acid molecules. 

IV. Production of Recombinant Proteins Using a Recom binant DNA frDNA^ Molecule 

The present invention further provides methods for producing polypeptides of the 
invention, such as SH2-REV6, SH2HS33, and SH2-REV6-HS using the nucleic acid 

25 molecules herein described. In general terms, the production of a recombinant form of a 
protein typically involves the following steps: First, a nucleic acid molecule is obtained that 
encodes, for example, an SH2-REV6, SH2HS33, and Sffi~REV6-HS protein, or a fragment 
of an SH2-REV6, SH2HS33, and SH2-REV6-HS protein. If the encoding sequence is 
uninterrupted by introns, it is directly suitable for expression in any host. The nucleic acid 

30 molecule is then preferably placed in operable linkage with suitable control sequences, as 
described above, to form an expression unit containing the protein open reading frame. The 
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expression unit is used to transform a suitable host and the transformed host is cultured under 
conditions that allow the production of the recombinant protein. Optionally the recombinant 
protein is isolated from the medium or from the cells; recovery and purification of the protein 
may not be necessary in some instances where some impurities may be tolerated. 

Each of the foregoing steps can be done in a variety of ways. For example, the 
desired coding sequences may be obtained from genomic fragments and used directly in 
appropriate hosts. The construction of expression vectors that are operable in a, variety of 
hosts is accomplished using appropriate replicons and control sequences, as set forth above. 
The control sequences, expression vectors, and transformation methods are dependent on the 
type of host cell used to express the gene and were discussed in detail earlier. Suitable 
restriction sites can, if not normally available, be added to the ends of the coding sequence 
so as to provide an excisable gene to insert into these vectors. A skilled artisan can readily 
adapt any host-expression system known in the art for use with the nucleic acid molecules of 
the invention to produce recombinant protein. 

V. SH2-REV6. SH2HS33. and SH2-REV6-HS Proteins 

As used herein, an SH2-REV6, SH2HS33, and SH2-REV6-HS protein refers to a 
protein that has the amino acid sequence encoded by the polynucleotide of SH2-REV6, 
SH2HS33, and SH2-REV6-HS, allelic variants thereof and conservative substitutions thereof 
that have SH2-REV6, SH2HS33, and SH2-REV6-HS activity. In addition, the polypeptides 
utilized in the present invention include the proteins encoded by SH2-REV6, SH2HS33, and 
SH2-REV6-HS, as well as polypeptides and fragments, particularly those which have the 
biological activity of SH2-REV6, SH2HS33, and SH2-REV6-HS and also those which have 
at least 65% sequence identity to the polypeptides encoded by SH2-REV6, SH2HS33, and 
SH2-REV6-HS or the relevant portion, or at least 70% identity, or at least 75% identity, or 
at least 80% identity, or at least 85% identity to the polypeptides encoded by SH2-REV6, 
SH2HS33 v and SH2-REV6-HS,^ 

sequence identity to the polypeptides encoded by SH2-REV6, SH2HS33, and SH2-REV6-HS 
or the relevant portion, and still more preferably at least 95% sequence identity to the 
polypeptides encoded by SH2-REV6, SH2HS33, and SH2-REV6-HS or the relevant portion, 
and also include portions of such polypeptides. One of skill will recognize whether an amino 
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acid sequence of interest is within a functional domain of a protein, such as the domain or 
region of the SH2-REV6, SH2HS33, and SH2-REV6-HS involved in the allosteric regulation 
of AGP. Thus, it may be possible for a homologous protein to have less than 40% homology 
over the length of the amino acid sequence but greater than 90% homology in one functional 
domain. 

The SH2-REV6, SH2HS33, and SH2-REV6-HS proteins utilized in the present 
invention include the specifically identified and characterized variants herein described as 
well as allelic variants, conservative substitution variants and homologues that can be 
isolated/generated and characterized without undue experimentation following the methods 
well known to one skilled in the art. 

The term "substantially pure" as used herein refers to polypeptides of the present 
invention, such as SH2-REV6, SH2HS33, and SH2-REV6-HS polypeptides, which are 
substantially free of other proteins, lipids, carbohydrates or other materials with which they 
are naturally associated. One skilled in the art can purify the subject polypeptides using 
standard techniques for protein purification. 

The invention also utilizes amino acid sequences coding for isolated polypeptides of 
the invention, such as the SH2-REV6, SH2HS33, and SH2-REV6-HS polypeptides. The 
polypeptides of the invention include those which differ from the exemplified SH2-REV6, 
SH2HS33, and SH2-REV6-HS proteins as a result of conservative variations. The terms 
"conservative variation" or "conservative substitution" as used herein denotes the replacement 
of an amino acid residue by another, biologically similar residue. Conservative variations or 
substitutions are not likely to change the shape of the polypeptide chain. Examples of 
conservative variations, or substitutions, include the replacement of one hydrophobic residue 
such as isoleucine, valine, leucine or methionine for another, or the substitution of one polar 
residue for another, such as the substitution of arginine for lysine, glutamic for aspartic acid, 
or glutamine for asparagine, and the like. Therefore, all conservative substitutions are 
included in the .invention as. long as .the subject polypeptides , encoded by the . nucleotide 
sequence are functionally unchanged or similar. 

As used herein, an isolated polypeptide of the present invention, such as an SH2- 
REV6, SH2HS33, and SH2-REV6-HS protein, can be a full-length or any homologue of such 
proteins, such as, for example, SH2-REV6, SH2HS33, and SH2-REV6-HS proteins in which 
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amino acids have been deleted (e.g., a truncated version of the protein, such as a peptide), 
inserted, inverted, substituted and/or derivatized (e.g., by glycosylation, phosphorylation, 
acetylation, myristoylation, prenylation, palmitoylation, amidation and/or addition of 
glycosylphosphatidyl inositol). Such modified proteins include those that retain at least one 
5 of the functional activities of the subject proteins or produce at least one of the physiological 
characteristics produced as a result of the expression of the subject proteins. A homologue 
of the subject proteins is a protein having an amino acid sequence that is sufficiently similar 
to the subject proteins, such as the SH2-REV6, SH2HS33, and SH2-REV6-HS protein amino 
acid sequences, that a nucleic acid sequence encoding the homologue is capable of 

10 hybridizing under stringent conditions to (i.e., with) a nucleic acid sequence encoding the 
subject proteins (e.g., SH2-REV6, SH2HS33, and SH2-REV6-HS protein amino acid 
sequences). Appropriate stringency requirements are discussed above. 

The subject protein homologues, including SH2-REV6, SH2HS33, and SH2-REV6- 
HS protein homologues, can be the result of allelic variation of a gene encoding the protein. 

15 For example, SH2-REV6, SH2HS33, and SH2-REV6-HS protein homologues can be 
produced using techniques known in the art including, but not limited to, direct modifications 
to a gene encoding a protein using, for example, classic or recombinant DNA techniques to 
effect random or targeted mutagenesis. 

Minor modifications of the primary amino acid sequence of a protein of the present 

20 invention may result in proteins which have substantially equivalent activity as compared to 
the subject proteins (e.g., SH2-REV6, SH2HS33, and SH2-REV6-HS) produced by the genes 
described herein. As used herein, a "functional equivalent" of a subject protein is a protein 
which possesses a biological activity or immunological characteristic substantially similar to 
a biological activity or immunological characteristic of the subject protein. * The term 

25 "functional equivalent" is intended to include the fragments, variants, analogues, homologues, 
or chemical derivatives of a molecule which possess the biological activity of proteins, such 
. as,. SH1-REV6, SH2HS33, and SH2-REV6-HS, encoded . by v .the . genes oflthe present 

invention ..... ... 

The terms "SH2-REV6, SH2HS33, and SH2-REV6-HS proteins," "SH2-REV6 

30 proteins," "SH2HS33 proteins " and "SH2-REV6-HS proteins" include all allelic variants of 
these proteins that possess normal SH2-REV6, SH2HS33, and SH2-REV6-HS activity. In 
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general, allelic variants of SH2-REV6, SH2HS33, and SH2-REV6-HS proteins will have 
slightly different amino acid sequence than that specifically encoded by the genes utilized in 
the present invention but will be able to produce the exemplified phenotypes. Allelic 
variants, though possessing a slightly different amino acid sequence than those recited 

5 individual a above, will posses the ability to produce a phenotype which exhibits increased 

individual and total seed weight, increased seed number, increased Harvest Index (HI) and 
increased above-ground plant mass. 

The methods of the present invention can be used by one skilled in the art to produce 
plants with increased individual and total seed weight, increased seed number, increased 

1 0 Harvest Index and increased total plant mass. 

Applicants further teach methods of recognizing variations in the DNA sequences of 
polynucleotides, such as Sh2-Rev6, Sh2hs33, and Sh2-Rev6-HS, of the present invention. One 
method involves the introduction of a nucleic acid molecule (also known as a probe) having 
a sequence complementary to, for example, an Sh2-Rev6, Sh2hs33, or Sh2-Rev6-HS gene, 

1 5 utilized in the invention under sufficient hybridizing conditions, as would be understood by 

those in the art. Another method of recognizing DNA sequence variation associated with 
polynucleotides of the present invention, including Sh2-Rev6, Sh2hs33, and Sh2-Rev6-HS, 
is direct DNA sequence analysis by multiple methods well known in the art. Another 
embodiment involves the detection of DNA sequence variation in the subject polynucleotides 

20 as represented by different plant genera, species, strains, varieties or cultivars. Polynucleotide 
sequences of the invention, for example, Sk2-Rev6, Sh2hs33, and Sh2-Rev6-HS, can be used 
as probes to detect the presence of corresponding genes in other plants. As discussed 
previously, Sh2-Rev6, Sh2hs33, and Sh2-Rev6-HS sequences have been determined and are 
readily available to one of ordinary skill in the art. In one embodiment, the sequences will 

25 bind specifically to one allele of an Sh2-Rev6, Sh2hs33, or Sh2-Rev6-HS gene, or a fragment 
thereof, and in another embodiment will bind to multiple alleles. Such detection methods 
include the polymerase chain reaction, restriction fragment length polymorphism (RFLP) 
analysis and single stranded conformational analysis. 

Diagnostic probes useful in such assays of the invention include antibodies to 

30 polypeptides of the present invention, such as SH2-REV6, SH2HS33, and SH2-REV6-HS. 

The antibodies may be either monoclonal or polyclonal, produced using standard techniques 
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well known in the art (See Harlow & Lane's Antibodies: A laboratory Manual . Cold Spring 
Harbor Laboratory Press, 1988). Antibodies can be used to detect a protein of the invention 
by binding to the protein and subsequent detection of the antibody-protein complex by 
ELISA, Western blot, or the like. Antibodies are also produced from peptide sequences of 
5 the subject proteins, such as SH2-REV6, SH2HS33, and SH2-REV6-HS, using standard 
techniques in the art (See Protocols in Immunology . John Wiley & Sons, 1994). Fragments 
of the monoclonals or the polyclonal antisera which contain the immunologically significant 
portion can also be prepared. 

Assays to detect or measure the subject polypeptides, for example, SH2-REV6, 

1 0 SH2HS33, and SH2-REV6-HS polypeptides, in a biological sample with an antibody probe 
may be based on any available format. For instance, in immunoassays where SH2-REV6, 
SH2HS33, or SH2-REV6-HS polypeptides are the analyte, the test sample, typically a 
biological sample, is incubated with anti-SH2-REV6, anti-SH2HS33, or anti-SH2-REV6-HS 
antibodies under conditions that allow the formation of antigen-antibody complexes. Various 

1 5 formats can be employed, such as "sandwich" assay where antibody bound to a solid support 
is incubated with the test sample; washed, incubated with a second, labeled antibody to the 
analyte; and the support is washed again. Analyte is detected by determining if the second 
antibody is bound to the support In a competitive format, which can be either heterogeneous 
or homogeneous, a test sample is usually incubated with an antibody and a labeled competing 

20 antigen, either sequentially or simultaneously. These and other formats are well known in the 
art 

VI. Transformation Methods 

Methods of producing transgenic plants are well known to those of ordinary skill in 
25 the art. Transgenic plants can now be produced by a variety of different transformation 

methods including, but not limited to, electroporation; microinjection; microprojectile 
' • 7 -bombardment, also known as particle, acceleration or biolistic bombardment; viralrmediated 

transformation; and Agrobacterium-mediated transformation (see, e.g., U.S. Patent Nos. 

5,405,765, 5,472,869, 5,538,877, 5,538,880, 5,550,318, 5,641,664, 5,736,369 and 5,736369; 
30 Watson et al (1992) Recombinant DNA . Scientific American Books; Hinchee et al (1988) 

Bio/Tech . 6:915-922 (1988); McCabe et al., Bio/Tech. 6:923-926; Toriyama et al (1988) 
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Biotech . 6:1072-1074; Fromm et al (1990) BfoOfiGh, 8:833-839; Mullins et al (1990) 
Bio/Tech. 8:833-839; and, Rained et al (1990) BioAEech. 8:33-38). 

A. Agrobacterium -Mediated Transformation 

Agrobacterium-mediated transformation is the most widely utilized method for 
introducing an expression vector into plants (Horsch et al (1985) Science 227:1229). A. 
tumefaciens and A. rhizogenes are plant pathogenic soil bacteria which genetically transform 
plant cells. The Ti and Ri plasmids of A, tumefaciens and A. rhizogenes, respectively, cany 
genes responsible for genetic transformation of the plant. (Kado,C.I. (1991) Crit Rev. Plant. 
Sci . 10:1). Descriptions of Agrobacterium vector systems and methods for 
Agrobacterium-mediated gene transfer are provided by Gruber et al (1993) "Vectors for 
Plant Transformation" in Methods in Plant Molecular Biology and Biotechnology, Glick, 

B. R. and Thompson, J. E. Eds. (CRC Press, Inc., Boca Raton), pages 89-119), Miki et al 
(1 993) 'Trocedures for Introducing Foreign DNA into Plants" in Methods in Plant Molecular 
Biology and Biotechnology . Glick, B. R. and Thompson, J. E. Eds. (CRC Press, Inc., Boca 
Raton) pages 67-88, and Moloney etal (1989) Plant Cell Report? 8:238. 

Agrobacterium-mediated transformation methods have been used principally to 
transform dicotyledonous plants. Agrobacterium-mediated transformation in dicotyledons 
facilitates the delivery of larger pieces of heterologous nucleic acid as compared with other 
transformation methods such as particle bombardment, electroporation, and polyethylene 
glycol-mediated transformation method. In addition, Agrobacterium-mediated transformation 
appears to result in relatively few gene rearrangements and more typically results in the 
integration of low numbers of gene copies into the plant chromosome. 

Monocotyledons are not a natural host of Agrobacterium. Although 
Agrobacterium-mediated transformation has been reported for asparagus (Bytebier et al 
(1QR7^ Pmc. Natl. Acad. Sci. USA 84:5354-5349) and for Dioscore bublifera (Schafer et al 
-(1987) Nature 327:529-532), it was generally believed that plants in the family Gramineae 
could not be transformed with Agrobacterium (Potrykus I. (1987) Biotechnology 8:535-543). 
However, recently in U.S. Patent No. 5,981,840, Zhao et al disclosed agrobacterium- 
mediated transformation in maize. The method of Zhao et al includes the following steps: 
contacting at least one immature embryo from a maize plant with Agrobacterium capable of 
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transferring at least one gene to said embryo; co-cultivating the embryos with Agrobacterium; 
culturing the embryos in medium comprising N6 salts, an antibiotic capable of inhibiting the 
growth of Agrobacterium, and a selective agent to select for embryos expressing the gene; 
and regenerating plants expressing the gene. 

5 

B. Microprojectile-Mediated Transformation 

In a microprojectile bombardment process, also referred to as a biolistic process, the 
transport of the DNA is mediated by very small particles of a biologically inert material. 
When the inert particles are coated with DNA and accelerated to a suitable velocity, one or 

1 0 more of the particles is able to enter into one or more of the cells where the DNA is released 
from the particle and expressed within the cell. While some of the cells are fatally damaged 
by the bombardment process, some of the recipient cells do survive, stably retain the 
introduced DNA, and express it. Sanford et al provides a general description of a suitable 
particle bombardment instrument (Sanford et al (1987) Particulate Sci. Technol . 5: 27-37). 

1 5 Microprojectile bombardment process has been used to successfully introduce genes 

encoding new genetic traits into a number of plants, including onion, cotton, maize, tobacco, 
rice, wheat, sunflowers, soybeans and certain vegetables (U.S. Pat. No. 4,945,050; Sanford 
et cL (1988) Trends in Biotechnology 6:299; Sanford et al (1988) Part. Sci. Technol. 5:27; 
J. J. Finer and M. D. McMullen (1990) Plant Cell Reports 8:586-589; and Gordon-Kamm 

20 (1990) The Plant Cell 2:603: Klein et al (1988) Proc. Nat. Acad. Sci. USA 85:4305^309). 

Although transformation by microprojectile bombardment is less species and genotype 
specific than transformation with Agrobacterium, the frequencies of stable transformation 
events achieved following bombardment can be quite low, partly due to the absence of a 
natural mechanism for mediating the integration of a DNA molecule or gene responsible for 

25 a desired phenotypic trait into the genomic DNA of a plant. Particle gun transformation of 
cotton for example, has been reported to produce no more than one clonal transgenic plant 
per 100-500 mmMerris targeted for transformation: Only 0.1 to 1% of these ti^sfdrmants 
were capable of tfeismittirig "foreign DNA to progeny (WO* 92/1 5675). Cells treated by 
particle bombardment must be regenerated into whole plants, which requires labor intensive, 

30 sterile tissue culture procedures and is generally genotype dependent in most crop plants, 
particularly so in cotton. Similar low transformation frequencies have been reported for other 
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plant species as well. Other disadvantages of microprojectile bombardment include the 
inability to control the site of wounding of a plant tissue and thus, the site to which the 
transforming agent is delivered. The inability to target germline tissues accounts in part for 
the low transformation efficiencies achieved by microprojectile bombardment. Additionally, 
5 bombardment frequently results in the delivery of more than one copy of the transforming 

DNA or gene into the genome of the transformed plant cell, which can have deleterious 
effects on the regenerated, transformed plant. Fragmentation of the DNA to be inserted can 
also occur when transforming DNA via microprojectile bombardment, resulting in transgenic 
plants with only a portion of the gene that is being inserted. 

1 0 Attempts have been made to improve the efficiency of microprojectile bombardment. 

For example, EPA 0486 233 describes treating bombarded tissues with Agrobacterium 
carrying the gene of interest. It is thought that the high velocity impact of the dense 
microprojectile particles generates an array of microwounds creating an environment that is 
particularly conducive to infection by the Agrobacterium. However, the transformed plant 

15 cells must still be regenerated into whole plants, and the fertile, stably transformed plants 
must be selected from the total population of regenerated plants. Organogenesis and somatic 
embryogenesis have been used to regenerate plants. Nonetheless, organogenesis frequently 
produces chimeric plant containing both transformed and nontransformed cells, and somatic 
embryogenesis, although superior to organogenesis is highly genotype dependent in most 

20 crop plants. 

Efforts have been made to deliver the transforming agent or DNA to germline tissues 
such that the agent or DNA will be incorporated directly into the DNA of the cells in these 
tissues, particularly into the DNA of the egg cells of the plant. In U.S. Patent No. 5,994,624, 
Trolinder etal. describes a method of implanta transformation which provides an improved 

25 method for delivering transforming agents to plant tissues. The method uses a 
needleless-injection device that is capable of injecting a small high pressure stream of a 

^ * solution through the many cell layers of plant tissue.- The transfonning agent is delivered to 
a plant's floral tissues, thereby facilitating delivery of a transforming agent comprising a gene 
of interest into germline cells of the plant. The high pressure stream provided by the injection 

30 device insures that the Agrobacterium culture or the DNA solution penetrates the many cell 
layers of the plant floral tissue without causing massive tissue damage, such as that caused 
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by direct injection with a syringe having a needle or by particle bombardment. The method 
can be used to transform plant cells and tissues, including embryonic tissue culture cells, 
meristematic tissues and plant callus, which can be regenerated into whole plants. Moreover, 
the method can be used to transform plant cells and tissues selected from the group consisting 
5 of cotton, soybean, alfalfa, flax, tobacco, sunflower, peanut, strawberry, tomato, pea, bean, 
squash, pepper, maize, sorghum, barley, oat, rye, wheat, rice, brassica, and potato. 

Although Klein et al (Klein et al (1988) Proc. Nat. Acad. Sci. USA 85:4305^309; 
Klein et al (1988) Bio/Technol . 6:59-563; Klein et al (1989) Plant Physiol . 91:440-444) 
provides protocols for bombardment of maize non-regenerable suspension culture cells, no 

1 0 protocols have been published for the bombardment of callus cultures or regenerable maize 
cells until recently. Lundquist etal (U.S. Patent No. 6,013,863) describes delivery of DNA 
into regenerable maize callus cultures via particle bombardment process which results in high 
level of viability for a few transformed cells. The method maybe applicable to producing 
fertile stably transgenic plants of other graminaceous cereals. Dwigjit et al (U.S. Patent No. 

1 5 5,990,387) discloses a method of producing fertile, stably transformed, Zea mays plant. The 
methods comprise the following steps: providing a foreign DNA comprising an expression 
vector carrying a gene encoding an agronomic trait; providing a maize embryogenic callus, 
suspension culture, or immature embryo isolated from a plant; introducing the foreign DNA 
into the embryogenic callus, suspension culture or immature embryo isolated from a plant by 

20 one or more microparticle bombardments; and regenerating fertile transgenic Zea mays plant. 
Plants that can be successfully transformed by the method of Dwight et al include maize, rye, 
barley, wheat, sorghum, oats, millet, rice, sunflower, alfalfa, rape seed and soybean. 

Biswas et al describes generation of transgenic rice plants by microprojectile 
bombardment of embryogenic cell clusters (Biswas et al ( 1 998) Plant Science . 1 33 :203-2 1 0), 

25 and Yao et al discloses the production of transgenic barley plants via direct delivery of 
plasmid DNA into microspores of barley using high velocity microprojectiles (Yao et al 

, . r ^ . (1997) Genome. 40:570-581). . Christou et <a/ ; . reports on the parameters =thatinfluence stable 

- transformation of rice embryogenic callus and the recovery of transgenic plants using electric r - 
discharge particle acceleration (Christou et al (1995) Annals of Botany 75:407-413). 



30 
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C. Alternative Methods of Transformation 

Other methods for physical delivery of DNA to plants include sonication of target 
cells (Zhang et ai (1991) Bio/Technologv 9:996) and liposome or spheroplast fusion 
(Deshayes et al (1985) EMBOI. 4: 273 1 , Christou et al (1987) ProcNatl. Acad. Sci. USA 
5 84: 3962). Direct uptake of DNA into protoplasts using CaCl 2 precipitation, polyvinyl 

alcohol or poly-L-ornithine has also been reported (Hain et al (1 985) Mol. Gen. Genet . 1 99: 
161 and Draper et al (1982) Plant Cell PhvsioL 23: 451). Nobre et al reports the 
regeneration of fertile transgenic plants of barley using PEG-mediated transformation of 
scutellum protoplast (Nobre et al (1997) Barlev Genetics Newsletter . 27:16-17). 

1 0 Electroporation of protoplasts and whole cells and tissues has also been described (Dorm et 
al ( 1 990) Abstracts of Vllth International Congress on Plant Cell and Tissue Culture IAPTC . 
A2-38, p 53; D'Halluin et al (1992) Plant Cell 4: 1495-1505 and Spencer et al (1994) Plant 
Mol. Biol . 24: 51-61). In fact, D'Halluin et al (U.S. Patent No. 6,002,070) describes a rapid 
and efficient method of transforming monocotyledonous plants by electroporation. The 

1 5 method of D'Halluin comprises electroporation of DNA of interest into either intact tissue 
capable of forming compact embryogenic callus or compact embryogenic callus obtained 
from intact tissue. 

Another technology for production of transgenic plants is whisker-mediated 
transformation whereby certain materials, when incubated with plant tissue, facilitate entry 
20 of DNA molecules into plant cells. It has been proposed that such materials that promote 
DNA uptake, primarily silicone carbide, do so by damaging the cell surface. For a review, 
see Wang et al (1995) In Vitro Cell Dev. Biol 34: 101-4. 

VII. Transgenes 

25 Genes successfully introduced into plants using recombinant DNA methodologies 

include, but are not limited to, those coding for the following traits: seed storage proteins, 
including modified 7S legume seed storage proteins (U.S. Patent Nos. 5,508,468, 5,559,223 
and 5,576,203); herbicide tolerance or resistance (U.S. Patent Nos. 5,498,544 and 5,554,798;. 
Powell et al (1986) Science 232:738-743; Kaniewski et al (1990) BioyTech. 8:750-754; Day 

30 etal (199H Proc. Natl. Acad. Sci. USA 88:6721-6725^: phytase (U.S. Patent No. 5,593,963); 

resistance to bacterial, fungal, nematode and insect pests, including resistance to the 
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lepidoptera insects conferred by the Bt gene (U.S. Patent Nos. 5,597,945 and 5,597,946; 
Hilderer al Nature 330:160-163; Johnson etal ri989) Proc. Natl. Acad. Sci. USA 86:9871- 
9875; Perlak et al (1990) Bio/Tech . 8:939-943); lectins (U.S. Patent No. 5,276,269); and 
flower color (Meyer et al ( 1 987) Nature 330:677-678; Napoli et al( 1990) Plant Cell 2:279- 
289 (1990); van der Krol et al (1990) Plant Cell 2:291-299). 

VEl. Expression Units to Express Exogenous DNA in a Plant 

The present invention further provides host cells transformed with a nucleic acid 
molecule that encodes a protein of the present invention. The host cell can be either 
prokaryotic or eukaryotic. Eukaryotic cells useful for expression of a protein of the invention 
are not limited, so long as the cell line is compatible with cell culture methods and compatible 
with the propagation of the expression vector and expression of the gene product. Preferred 
eukaryotic host cells include any plant species. 

Any prokaryotic host can be used to express a rDNA molecule encoding a protein 
of the invention. The preferred prokaryotic host is E. coll 

Transformation of appropriate cell hosts with a rDNA molecule of the present 
invention is accomplished by well known methods that typically depend on the type of vector 
used and host system employed. With regard to transformation of prokaryotic host cells, 
electroporation and salt treatment methods are typically employed, see, for example, Cohen 
et al (\91T\ Proc. Natl. Acad. Sci. USA 69:21 10-21 14; and Maniatis et al (1982) Molecular 
Cloning - A Laboratory Manual. Cold Spring Harbor Laboratory Press. With regard to 
transformation of vertebrate cells with vectors containing rDNAs, electroporation, cationic 
lipid or salt treatment methods are typically employed, see, for example, Graham et al (1973) 
Virology 52:456-467; and Wigler etal (1979) Proc. Natl Acad. Sci. USA 76:1373-1376. 

Successfully transformed cells, i.e., cells that contain a rDNA molecule of the present 
invention, can be identified by well known techniques including the selection for a selectable 
marker. For example,_ cel]s resulting from the introduction of an rDNA of the present 
invention can be cloned to produce single colonies. Cells from those colonies can be 
harvested, lysed and their DNA content examined for the presence of the rDNA using a 
method such as that described by Southern, (1975) J. Mol. Biol. 98:503-5 17; or Berent et al 
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(1985) Biotech. Histochem. 3:208; or the proteins produced from the cell assayed via an 
immunological method. 

As provided herein elsewhere, several embodiments of the present invention employ 
expression units (or expression vectors or systems) to express an exogenously supplied 
5 nucleic acid sequence, such as the sequence coding for SH2-REV6, SH2HS33, and SH2- 
REV6-HS protein in a plant Methods for generating expression units/systems/vectors for use 
in plants are well known in the art and can readily be adapted for use in expressing 
polynucleotide sequences encoding proteins of the present invention, such as SH2-REV6, 
SH2HS33, and SH2-REV6-HS proteins, in a plant cell. A skilled artisan can readily use any 
10 appropriate plant/vector/expression system in the present methods following the outline 
provided herein. 

The expression control elements used to regulate the expression of the protein can 
either be the expression control element that is normally found associated with the coding 
sequence (homologous expression element) or can be a heterologous expression control 

1 5 element. A variety of homologous and heterologous expression control elements are known 
in the art and can readily be used to make expression units for use in the present invention. 
Transcription initiation regions, for example, can include any of the various opine initiation 
regions, such as octopine, mannopine, nopaline and the like that are found in the Ti plasmids 
of Agrobacterium tumefaciens. Alternatively, plant viral promoters can also be used, such 

20 as the cauliflower mosaic virus 35S promoter to control gene expression in a plant. Lastly, 
plant promoters such as prolifera promoter, fruit-specific promoters, Ap3 promoter, heat 
shock promoters, seed-specific promoters, etc. can also be used. The most preferred 
promoters will be most active in seedlings. 

Either a constitutive promoter (such as the CaMV or Nos promoter), an organ-specific 

25 promoter (such as the E8 promoter from tomato) or an inducible promoter is typically ligated 
to the protein or antisense encoding region using standard techniques known in the art. The 
expression unit may be further optimized by employing supplemental elements such as 
transcription terminators and/or enhancer elements. 

Thus, for expression in plants, the expression units will typically contain, in addition 

30 to the protein sequence, a plant promoter region, a transcription initiation site and a 
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transcription termination sequence. Unique restriction enzyme sites at the 5' and 3' ends of 
the expression unit are typically included to allow for easy insertion into a preexisting vector. 

In the construction of heterologous promoter/structural gene or antisense 
combinations, the promoter is preferably positioned about the same distance from the 
5 heterologous transcription start site as it is from the transcription start site in its natural 
setting. As is known in the art, however, some variation in this distance can be 
accommodated without loss of promoter function. 

In addition to a promoter sequence, the expression cassette can also contain a 
transcription termination region downstream of the structural gene to provide for efficient 

10 termination. The termination region may be obtained from the same gene as the promoter 
sequence or may be obtained from different genes. If the mRNA encoded by the structural 
gene is to be efficiently processed, DNA sequences which direct polyadenylation of the RNA 
are also commonly added to the vector construct. Polyadenylation sequences include, but are 
not limited to the Agrobacterium octopine synthase signal (Gielen et al (1984) EMBO J 

15 3:835-846) or the nopaline synthase signal (Depicker et al (1982) Mol. and Appl. Genet 1 : 
561-573). 

The resulting expression unit is ligated into or otherwise constructed to be included 
in a vector which is appropriate for higher plant transformation. The vector will also typically 
contain a selectable marker gene by which transformed plant cells can be identified in culture. 
20 Usually, the marker gene will encode antibiotic resistance. These markers include resistance 
to G418, hygromycin, bleomycin, kanamycin, and gentamicin. After transforming the plant 
cells, those cells having the vector will be identified by their ability to grow on a medium 
containing the particular antibiotic. Replication sequences, of bacterial or viral origin, are 
generally also included to allow the vector to be cloned in a bacterial or phage host, 
25 preferably a broad host range prokaryotic origin of replication is included. A selectable 
marker for bacteria should also be included to allow selection of bacterial cells bearing the 
^ — desired construct. Suitable prokaryotic selectable markers., also -include, resistance to 
- antibiotics such as kanamycin or tetracycline. - 

Other DNA sequences encoding additional functions may also be present in the 
30 vector, as is known in the art. For instance, in the case of Agrobacterium transformations, 
T-DNA sequences will also be included for subsequent transfer to plant chromosomes. 
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The polynucleotide sequences of the subject invention, such as the Sh2-Rev6, 
Sh2hs33, and Sh2-Rev6-HS sequences, utilized in the present invention can also be fused to 
various other nucleic acid molecules such as Expressed Sequence Tags (ESTs), epitopes or 
fluorescent protein markers. 
5 ESTs are gene fragments, typically 300 to 400 nucleotides in length, sequenced from 

the 3' or 5' end of complementary-DNA (cDNA) clones. Nearly 30,000 Arabidopsis thaliana 
ESTs have been produced by a French and an American consortium (Delseny et al. (1997) 
FEBS Lett. 405(2): 129-132; Arabidopsis thaliana Database, 
http://genome.www.stanford.edii/Arabidopsis). For a discussion of the analysis of gene- 

10 expression patterns derived from large EST databases, see, e.g., M. R. Fannon (1996) 
TEBTECH 14:294-298. 

Biologically compatible fluorescent protein probes, particularly the self-assembling 
green fluorescent protein (GFP) from the jellyfish Aequorea victoria, have revolutionized 
research in cell, molecular and developmental biology because they allow visualization of 

15 biochemical events in living cells (Murphy etal. (1997) Cuir. Biol . 7(1 1):870-876; Grebenok 
et al (1997) Plant J . 1 l(3):573-586; Pang et al. (1996) Plant Physiol. 1 12(3); Chiu et al. 
(1996) Cun. Biol . 6(3):325-330; Plautz et al, (1996) Gene 173(l):83-87; Sheen et al (1995) 
PlanLL8(5):777-784). 

Site-directed mutatgenesis has been used to develop a more soluble version of the 

20 codon-modified GFP call soluble-modified GFP (smGFP). When introduced into 
Arabidopsis, greater fluorescence was observed when compared to the codon-modified GFP, 
implying that smGFP is 'brighter' because more of it is present in a soluble and functional 
form (Davis et al. (1998) Plant Mol. Biol . 36(4):52 1-528). By fusing genes encoding GFP 
and beta-glucuronidase (GUS), researchers were able to create a set of biflinctional reporter 

25 constructs which are optimized for use in transient and stable expression systems in plants, 
including Arabidopsis (Quaedvlieg etal (1998) Plant Mol. Biol 37(4):7 15-727). 

.;. Berger et al (Berg etal 09981 Dev. BioL 194(2):226^234) report the isolation of a 
GFP marker line for Arabidopsis hypocotyl epidermal cells. GFP-fusion proteins have been r 
used to localize and characterize a number of Arabidopsis genes, including geranylgeranyl 

30 pyrophosphate (GGPP) (Zhu et al (1997) Plant Mol. Biol . 35(3):331-341). 
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IX. folding Methods 

Open-Pollinated Populations . The improvement of open-pollinated populations of 
such crops as rye, many maizes and sugar beets, herbage grasses, legumes such as alfalfa and 
clover, and tropical tree crops such as cacao, coconuts, oil palm and some rubber, depends 

5 essentially upon changing gene-frequencies towards fixation of favorable alleles while 
maintaining a high (but far from maximal) degree of heterozygosity. Uniformity in such 
populations is impossible and trueness-to-type in an open-pollinated variety is a statistical 
feature of the population as a whole, not a characteristic of individual plants. Thus, the 
heterogeneity of open-pollinated populations contrasts with the homogeneity (or virtually so) 

1 0 of inbred lines, clones and hybrids. 

Population improvement methods fall naturally into two groups, those based on 
purely phenotypic selection, normally called mass selection, and those based on selection 
with progeny testing. Interpopulation improvement utilizes the concept of open breeding 
populations; allowing genes for flow from one population to another. Plants in one 

1 5 population (cultivar, strain, ecotype, or any germplasm source) are crossed either naturally 
(eg., by wind) or by hand or by bees (commonly Apis mellifera L. or Megachile roiundata 
F.) with plants from other populations. Selection is applied to improve one (or sometimes 
both) populations) by isolating plants with desirable traits from both sources. 

There are basically two primary methods of open-pollinated population improvement. 

20 First, there is the situation in which a population is changed en masse by a chosen selection 
procedure. The outcome is an improved population which is indefinitely propagable by 
random-mating within itself in isolation. Second, the synthetic variety attains the same end 
result as population improvement but is not itself propagable as such; it has to be 
reconstructed from parental lines or clones. These plant breeding procedures for improving 

25 open-pollinated populations are well known to those skilled in the art and comprehensive 
reviews of breeding procedures routinely used for improving cross-pollinated plants are 
■ . „ .provided in, numerous ..texts, and articles,, including; Allard, (I960).. Principles of -Plant 
Breeding . John Wiley & Sons, Inc.; Simmonds (1979^ Principles of Crop Improvement 
Longman Group Limited; Hallauer and Miranda (1981) Quantitative Genetics in Maize 

30 Breeding , Iowa State University Press; and, Jensen ( 1 988) Plant Breeding Methodology . John 
Wiley & Sons, Inc. 
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Mass Selection . In mass selection, desirable individual plants are chosen, harvested, 
and the seed composited without progeny testing to produce the following generation. Since 
selection is based on the maternal parent only, and their is no control over pollination, mass 
selection amounts to a form of random mating with selection. As stated above, the purpose 
of mass selection is to increase the proportion of superior genotypes in the population. 

Synthetics . A synthetic variety is produced by crossing inter se a number of 
genotypes selected for good combining ability in all possible hybrid combinations, with 
subsequent maintenance of the variety by open pollination. Whether parents are (more or less 
inbred) seed-propagated lines, as in some sugar beet and beans (Vicia) or clones, as in 
herbage grasses, clovers and alfalfa, makes no difference in principle. Parents are selected 
on general combining ability, sometimes by test crosses or topcrosses, more generally by 
polycrosses. Parental seed lines may be deliberately inbred (e.g., by selfing or sib crossing). 
However, even if the parents are not deliberately inbred, selection within lines during line 
maintenance will ensure that some inbreeding occurs. Clonal parents will, of course, remain 
unchanged and highly heterozygous. 

Whether a synthetic can go straight from the parental seed production plot to the 
farmer or must first undergo one or two cycles of multiplication depends on seed production 
and the scale of demand for seed. In practice, grasses and clovers are generally multiplied 
once or twice and are thus considerably removed from the original synthetic. 

While mass selection is sometimes used, progeny testing is generally preferred for 
polycrosses, because of their operational simplicity and obvious relevance to the objective, 
namely exploitation of general combining ability in a synthetic. 

The number of parental lines or clones that enter a synthetic vary widely. In practice, 
numbers of parental lines range from 10 to several hundred, with 100-200 being the average. 
Broad based synthetics formed from 100 or more clones would be expected to be more stable 
during seed multiplication than narrow based synthetics. 
.- ■ - - Hybrids . A hybrid is an individual plant resulting from a cross between parents of 
differing genotypes. Commercial hybrids are now used extensively in many crops, including 
com (maize), sorghum, sugarbeet, sunflower and broccoli. Hybrids can also be produced in 
wheat and rice. Hybrids can be formed a number of different ways, including by crossing two 
parents directly (single cross hybrids), by crossing a single cross hybrid with another parent 
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(three-way or triple cross hybrids), or by crossing two different hybrids (four-way or double 
cross hybrids). 

Strictly speaking, most individuals in an outbreeding (i.e., open-pollinated) population 
are hybrids, but the term is usually reserved for cases in which the parents are individuals 

5 whose genomes are sufficiently distinct for them to be recognized as different species or 
subspecies. Hybrids may be fertile or sterile depending on qualitative and/or quantitative 
differences in the genomes of the two parents. Heterosis, or hybrid vigor, is usually 
associated with increased heterozygosity which results in increased vigor of growth, survival, 
and fertility of hybrids as compared with the parental lines which were used to form the 

1 0 hybrid. Maximum heterosis is usually achieved by crossing two genetically different, highly 
inbred lines. 

The production of hybrids is a well-developed industry, involving the isolated 
production of both the parental lines and the hybrids which result from crossing those lines. 
For a detailed discussion of the hybrid production process, see, e.g., Wright, Commercial 
15 Hybrid Seed Production 8:161-176. In Hybridization of Corp Plants , supra. 

X. Seed Number. Grain Yield, and Sink Capacity in Wheat 

Wheat seed number and subsequent grain yield is affected by competition between 
inflorescences (Whingwiri et aL, 1981). Wheat yield is always lower than ears potential due 
20 to lack of assimilate supply or competition among florets limiting seed size and/or number 
(Zamski and Grunberger, 1995). Healthy, well-grown wheat plants always produce more 
shoots (potential heads) and florets (potential seeds) than heads and seeds. A significant 
- factor controlling seed number is sink strength of the developing seeds (Thome and Wood, 
1987). A review of this area (Evans et aL, 1975) indicates for wheat in many cases, yield is 
25 limited by the sink capacity of developing seeds. The limitations imparted by low sink 
strength may be seen as reduced grain set, reduced number of wheat heads, and reduced 
. ^ individual seed we 

. developing heads .determines, the survival of initiated florets, and plays a significant role in 
determining final grain number (Spieitz and vanKeulen, 1980; Abbate et aL, 1998). Possibly 
30 the most effective method of increasing kernel number in wheat would be to modify 
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assimilate flow to developing kernels (Bindraban et al, 1998). The transgenic wheat of the 
present invention which contains increased sink strength confirms this hypothesis. 

All patents, patent applications, provisional applications, and publications referred to 
5 or cited herein are incorporated by reference in their entirety to the extent they are not 
inconsistent with the explicit teachings of this specification. 

Materials and Methods 
L Production of Transgenic Plants 
10 Vectors according to the invention may be used to transform plants as desired, to 

make plants according to the invention as discussed elsewhere herein. 

Wheat Transformation . The methods described by Weeks et al (1993) and Vasil et 
al (1993) have been adopted with minor modifications for transforming the wheat cultivar 
'Hi-Line' (Lanning et al, 1992). The technique as routinely practiced initially utilizes 
1 5 immature embryos isolated from wheat cultivars approximately 7 days post anthesis. 

The Biolistic PDS-1000 He (Bio-Rad laboratories, USA) device was used for 
transforming the wheat tissues via microprojectile bombardment. 

For wheat calli 1500 psi rupture discs were used. Other procedures such as 
sterilization of the rupture discs, macrocairiers, stopping screens etc., were strictly in 
20 accordance with the manufacturer's manual. 

Rice Transformation . The methods described by Sivamani et al (1996) may be 
adopted for transforming rice cultivar 'M202' (Johnson et al 1986). The technique as 
routinely practiced initially utilizes embryogenic calli cultured from mature seeds. 

The Biolistic PDS-1000 He (Bio-Rad laboratories, USA) device is used for 
25 transforming the rice tissues via microprojectile bombardment. 

For rice calli 1500 psi rupture discs are used. Other procedures such as sterilization 
of the.rupture discs, macrocairiers, stopping screens etc., are strictly in accordance with the 
- manufacturer's manual. 

Pea Transformation . The methods described by U.S. Patent No. 5,286,635 (Example 
30 9) and U.S. Patent No. 5,773,693 (Example V) may be adopted with minor modifications for 
transforming the pea {Pisum sativum L.) cultivar Tea Green Arrow' (available commercially 
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from Paik Seed®). Pea explant material is transformed by incubation with Agrobacterium 
cells carrying Sh2-Rev6-HS sequence. The pea explant is preferably obtained from the 
plumule of a pea seed, and transformed shoots are preferably induced directly in the explant 
material without passage through a callus phase. Whole transformed pea plants may be 
5 regenerated from the transformed shoots by rooting and subsequent planting in the soil The 
exogenous Sh2-Rev6-HS DNA will be stably incorporated into the chromosomes of the 
regenerated Tea Green Arrow' plant which will be able to express the gene. 

II. Pl^mids 

1 0 Wheat The plasmid DNA pRQ 1 0 1 containing the coding sequence of the Bar gene 

(Fromm et ai, 1990) under control of the CaMV 35S promotor with AdhI intron and NOS 
terminator was used as selectable marker for selecting transgenic wheat tissue. 

Rice . As a selectable marker for rice, the plasmid DNA pILTAB222 containing the 
coding sequence of the hygromycin B phosphotransferase under the control of the maize 
1 5 ubiquitin promoter was used (Sivamani et aL, 1 996). 

Pea . As a selectable marker for pea, the coding sequence of cefotoxime resistance 
may be used according to U.S. Patent No. 5,773,693. This anti- Agrobacterium antibiotic may 
be used in the selection and regeneration medium (500 mg/1) used for growing the pea callus. 
General . The marker genes (i.e., Bar, hygromycin resistance, or cefotaxime) were on 
20 different construct than Sh2-Rev6-HS genes. 

For the introduction of the Sh2-Rev6-HS genes into cereals, plasmid pSh2-Rev6-HS 
were created. Besides containing Sh2-Rev6-HS cDNA, the plasmid also contained the Sh2 
promoter, Shi first intron, and NOS terminator (Rogers et aL, 1987). Specifically, plasmid 
pSh2-Rev6-HS contains the following nucleotide fragments linked in the 5 ! to 3' direction: 
25 nucleotides -1084 to +36 of the Sh2 promoter, 8 nucleotides of polylinker; two C's; 
nucleotides of the Shi intron 1 cassette containing nucleotides +43 to +52 of Shi exon 1, 

- -nucleotides +53 to +1080 ofSW-mtron Aandnucleotides +1081 to.-l097.af 5A/ exon 2;-one 

C; 13 nucleotides of polylinker containing a BamHl restriction site; cDN A encoding Sh2- 
Rev6-HS (SEQ ID NO: 3); 18 nucleotides polylinker containing Kpnl and SstI restriction 
30 sites; and nucleic acid of the NOS terminator. The nucleic acid sequence of the Sh2 promoter 
is disclosed by Shaw and Hannah, (1992), Plant Physiology . 98:1214-1216. The sequence 
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numbering of Shi intron cassette is shown in Zack et al (1986) Maydjca, 31, 5-16, and the 
effects of the Shi intronl cassette on transient gene expression are described by Clancy et 
al ((1994) Plant Science. 98, 151-161) and Vasil et al ((1989), Plant Science, 91, 1575- 
1579). The 3 additional C's (2 at the 5' end and 1 at the 3* end) are subcloning derived 

5 nucleotides. The plasmid includes transit peptide and consensus start site. Plasmid pSh2- 
Rev6-HS used in the present application is provided by Florida State University. 

For introduction of Sh2-Rev-HS into dicots such as pea, the above plasmid is 
modified such that the Sh2 promoter is replaced with a dicot seed specific promoter such as 
pea vicilin promoter (U.S. Patent No. 5,773,693). Other suitable promoters and/or constructs 

10 for expression of Sh2-Rev6-HS in dicots are well known to the skilled artisan (see, e.g., U.S. 
Patent No. 5,773,693). 



HI. Selection and Regenerati on of Transgenic Plants 

Wheat . Transgenic wheat plants were obtained from bombarded immature embryos 
15 by the methods described by Weeks et al. (1993) and Vasil et al (1993) using bialaphos 
(Meiji Seika Kaisha Ltd, Japan) selection. The resistant calli of wheat are transferred to 
medium to induce production of both shoots and roots. 

Rice . Transgenic rice plants were obtained from the bombarded embryogenic calli 
of rice by the technique of Sivamani et al (1996) using hygromycin selection. The resistant 
20 calli of rice are transferred to medium to induce production of both shoots and roots. 

Peas . Transgenic pea plants may be obtained from Agrobacterium-transformed calli 
of pea explants by the method of U.S. Patent No. 5,773,693 using cefotaxime selection. 

Pea shoots may be rooted by transfer to Sorbarod plugs (Baumgartnen Papiers SA, 
Switzerland) and soaked in liquid YRM according to U.S. Patent No. 5,773,693 (Example 
25 V). 

General . Putative transgenic plantlets were transferred to the greenhouse and allowed 
to self-fertilize. . For wheat, typically more than 75% of these piantl_ets_ are , escapes_and true 
_ transgenic plants were selected by spraying the plants with 0.1% glufosinate. (Liberty®, 
Agrevo Inc.). 



30 
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IV. P rints for PCR 

An Sh2 specific primer and a NOS specific primer for PCR were used to confirm the 
presence of S/j2-/?ev6-//S' transgene in the transgenic plants. The 5* primer was MC4Sh2, a 
26-mer which is specific to Sh2 sequences in the construct: 

5' CTG GAT GTG AAC TCA AGG ACT CCG TG 3* (SEQ ID NO: 5). 

The 3' primer was MC35PUC19, a 24-mer specific to the puc backbone of the 
construct: 

5' GGC TTA ACT ATG CGG CAT CAG AGC 3' (SEQ ID NO: 6). 
The primers produce a PCR product of 826 bp (309 bp of Sh2 cDNA, 260 bp of 
NOS, and 257 bppUC19). 

Following are examples which illustrate procedures for practicing the invention. 
These examples should not be construed as limiting. All percentages are by weight and all 
solvent mixture proportions are by volume unless otherwise noted. 

Example 1— Genetic Analysis of Transgenic Wheat Plants 

The initial pool of wheat transformants yielded a number of independent 
transformants which were transgenic for Sh2-Rev6-HS and/or basta resistance. 

The T 0 plants were allowed to set seed and mature in the greenhouse under controlled 
conditions. 

The selected wheat transformants were analyzed by PCR for the presence of the 
introduced transgene and for T t seed segregation data for basta resistance. 

PCR screening of transgenic wheat plants utilized MC4Sh2 and MC35PUC19 
(primer sequences given above) for the presence of Sh2-Rev6-HS in genomic DNA samples 
prepared from leaf tissue using standard PCR protocols. 

Twenty seven independent lines of transgenic wheat were tested. All 27 transgenic 
lines tested positive for basta resistance. Fifteen of the 27 transgenic lines tested positive for 
the presence of the Sh2-Rev6-HS transgene and the other 12 did not test positive for the 
presence of the Sh2-Rev6-HS transgene. 
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f frample 2 — Phenotypic Analysis of the Transgenic Wheat Plants 

Various phenotypic traits were collected and analyzed for each of the 27 transgenic 
wheat plants grown in a greenhouse. As mentioned previously, all 27 transgenic lines carried 
the herbicide resistance gene. These traits included the following: number of seeds per plant 
5 (Seeds/Plant); individual seed weight (Individual Seed Wt.) in milligrams per kernel 

(mg/kernel); Harvest Index (Harvest Index); total seed weight (Total Seed Wt.) in grams per 
plant (g/plant); number of grain heads per plant (Heads); total plant weight (Plant Wt) in 
grams per plant (g/plant); and flag leaf weight (Flag Leaf WL) in grams per plant (g/plant). 
Seeds were uniformly dried in a 37 °C incubator to a moisture of between about 10% 
10 to about 14%. 

The above ground parts of the plants were harvested at time of maturity and 
uniformly dried to about 0% moisture in a 125 °C incubator. The dried plant weights and 
dried flag leaf weights were adjusted to reflect weights at the same moisture content as that 
of the seeds (i.e., about 10% to about 14%). Roots were not collected. 
15 Plant weight represents the total weight of the "above ground" plant parts not 

including the total seed weight of the plant and the flag leaf weight of the plant. 
Harvest Index (HI) was calculated as follows: 

HI = {(Total Seed Wt.)/(Total Seed Wt. + Plant Wt. + Flag Leaf Wt.)}. 
For the number of wheat heads per plant, the number of heads were counted without 
20 regard to whether or how many seeds were in any particular head. 

The phenotypic data were analyzed in several different ways, as discussed below. 
Comparison Between PCR+ and PCR- Lines . This comparison was made for all 
transgenic lines (15 lines) with positive PCR results (PCR+) for Sh2-Rev6-HS versus all 
transgenic lines (12 lines) with negative PCR results (PCR-) for Sh2-Rev6-HS. Thus, the 
25 PCR+ lines carry both the herbicide resistance gene and the Sh2-Rev6-HS gene while the 

PCR- lines only carry the herbicide resistance genes. The results are presented in Table I. 
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Table I. Comparison Between PCR+ and PCR- Lines 






Number 
and Types 
of Lines 




Seeds/ 
Plants 


Individual 
Seed Wt. 


Total 
Seed Wt. 


Harvest 
Index 


Heads 


Plant Wt. 


Flag 
Leaf 
Wt. 


#of 
Plants 








(mg/kemel) 


(g/plant) 




# 


(g/plant) 


(g/plant) 




15 PCR + 


Avg 
Std 


63.68** 
10.59 


25.2*** 
1.80 


1.73** 
0.34 


0.30** 
0.08 


5.71 
0.86 


3.74 
2.38 


0.40 
0.20 


183 


12 PCR - 


Avg 
Std 


53.50 
9.6 


24.10 
1.50 


1.37 
0.26 


0.25 
0.10 


5.82 
0.81 


3.64 
1.74 


0.41 
0.18 


148 


PCT+/PCR- 




1.19* 


1.05*** 


1.26** 


1.17** 


0.98 


1.03 


0.97 





Indicate p values of less than or equal to: 0.05, 0.01, or 0.001 , respectively, based on a t test. 



10 

Comparison Between SH2+ and SH2- . In the second comparison, only the 8 
transgenic lines with positive PCR results for Sh2-Rev6-HS which also displayed increases 
in the levels of the introduced protein (SH2+) were averaged and compared with all other 
lines (SH2-). The 8 PCR+ lines which are SH2+ are the lines for which increased levels of 
1 5 the introduced protein were detected. Basically, the SH2 levels were compared to those of 
the lines which were transgenic for only the heibicide resistance gene. Those experimental 
plants which produced 25% or more of the SH2 protein as compared to the production of 
SH2 by the lines which were transgenic for only the herbicide resistance gene were 
designated as "SH2+". 

20 The SH2+ lines were compared to the other 19 lines ("SH2-") which lack any 

significant expression of the introduced protein. Thus, the 1 9 SH2- lines includes the 7 PCR+ 
lines which did not express significant levels of the SH2-REV6-HS protein and the 12 PCR- 
lines which did not express SH2-REV6-HS at all. The data is presented in Table II. 



Table II. Comparison Between SH2+ and SH- 


Number 




Seeds/ 


Individual 


Total 


Harvest 


Heads 


Plant Wt. 


Flag Leaf 


#of 


and Types 




Plants 


SeedWt. 


Seed 


Index 






Wt 


Plants 


of Lines 








Wt. 


















(mg/kemel) 


(g/plant) 






(g/plant) 


(g/plant) 




8 SH2+ 


Avg 


78.23*** 


26.9*** 


2.19*** 


0.32*** 


6.00 


4.29*" 


0.45** 


100' 




Std 


45.50 


4.10 


1.35 


0.10 


2.93 


2.29 


0.21 




19 SH2- 


Avg 


50.86 


23.80 


1.30 


0.25 


5.66 


3.44 


0.39 


231 




Std 


33.65 


4.70 


0.94 


0.13 


Z38 


1.99 


0.18 




SH2+/SH2- 




1.54*** 


1.13*** 


1.68*** 


1.25*** 


1.06 


1.25*** 


1.15** 





indicate p values of less than or equal to: 0.05, 0.01 , or 0.001 , respectively, based on a wests. 
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The data presented in Table II show that the total number of seeds per plant for the 
SH2+ lines increased about 54% in comparison to the total number of seeds per plant for the 
SH2- lines. Individual seed weight increased about 13% and the total seed yield increased 
about 68% for the SH2+ lines when compared to the SH2- lines. Harvest Index for the SH+ 
lines was about 25% greater than that for the SH2- lines. The SH2+ lines were also 
significantly bigger in total plant mass and in flag leaf weight (about +25% and about + 1 5%, 
respectively). 

Comparison Between Lines Homozygo us and Heterozygous for Sh2-Rev6-HS. Tl 
plants found to be homozygous by progeny testing the T2 seeds were designated as t£ Homoz 
SH2+" The seeds of Homoz SH2+ plants are expected to have a greater dosage of the 
transgene than the other lines. In this comparison, the Homoz SH2 plants were compared to 
SH2+ plants that were heterozygous (Heteroz SH2+) and were also compared to the 12 PCR- 
lines. 



15 



20 



25 



30 



Table III. Comparison Between Lines Horn 
and Heterozygous for Sh2-Rev6~HS 


ozygous 

r 




Types of 
Lines 




Seeds/ 
Plants 


Individual 
Seed Wt. 


Total 
Seed 
Wt 


Harvest 
Index 


Heads 


Plant 
Wt. 


Flag Leaf 

Wt. 


#of 
Plants 








(mg/kemel) 


(g/piant) 






(g/plant) 


(g/plant) 




Homoz 
SH2+ 


Avg 
Std 


97.1* 
43.60 


27.60 
2.60 


2.74* 
1.31 


0.30 
0.03 


7.59** 
3.10 


5.76 
2.25 


0.56 
0.17 


22 


Heteroz 
SH2+ 


Avg 
Std 


74.50 
45.10 


26.80 
3.80 


2.10 
1.34 


0.33 
0.10 


5.58 
2.54 


3.91 
2.08 


0.42 
0.20 


66 


SH2+ 
Homoz/ 
Heteroz 




1.30* 


1.03 


1.30* 


-0.05 


1.36** 


1.47 


1.33 


22/66 


SH2 
Homoz/ 
12 PCR- 




1.91*** 


1.16*** 


2.10*** 


1.16* 


1.34** 


1.67*** 


1.43*** 


22/148 



The majority (approximately two-thirds) of the plants analyzed were determined to 
be heterozygous for Sh2-Rev6-HS and, therefore, only have half of the possible dosage of the 
transgene coding for SH2-REV6-HS. 
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To determine the effect of increased gene dosage, individual T, plants were 
determined to be homozygous or heterozygous by progeny testing the T 2 seeds harvested 
from the plants. Lack of segregation for the herbicide resistance marker gene was taken as 
evidence of homozygosity. A comparison of 22 SH2+ homozygous plants with heterozygous 
5 SH+ plants indicates that increasing the dosage of the Sh2-Rev6-HS transgene leads to even 

larger yield and plant growth increases over plants which do not contain or express the 
transgene. The results provided in Table EI indicate an approximately 110% increase in total 
seed weight per plant compared to SH2+ heterozygous plants. 

10 Example 3— Experiments with Rice 

The transgenic rice plants are produced as described in the Materials and Methods. 
The resultant rice plants are analyzed as set forth in Examples 1 and 2. 

Example 4— Ex periments with Pea 
1 5 The transgenic pea plants are produced as described in the Materials and Methods. 

The resultant pea plants are analyzed as set forth in Examples 1 and 2. 

Example 5— Northern Analysis of SH2-REV6-HS Transgenic Rice Lines 

Ten (10) or more developing seeds were harvested from individual TO transgenic rice 
20 lines. All TO transgenic lines were PCR positive for the Sh2-Rev6-HS ftznsgene. KNA was 
prepared and analyzed according to standard techniques. Duplicate blots were probed with 
a small AGP subunit probe (Brittle-2) or the Sh2-Rev6-HS transgene coding sequences. The 
genotype labeled M202 is a varietal control. 

As can be seen in Figure 1, RSI, RS4, RS10, RS20, and RS22 transgenic plants 
25 express the Sh2-Rev6-HS transgene, in contrast to untransformed M202 plant which does not 
express the transgene. Due to small differences in loading, minor differences in expression 
. may or may not be due to the transgene. Significant differences in loading are not apparent 
in a duplicate blot probed with the Brittle-2 gene. 



30 
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F^m ple fr-AGP ^tivitv and Tl Sred Weight of SH2-REV6-HS Transgenic LilKS 



Plant AGP Activity Tl Seed Weight 

M202 100 ' Comparable data for M202 not available 

RSI 122 27.0 mg 

RS2 127 27.5 mg 

RS3 121 23.8 mg 

RS4 120 27.1 mg 

RS6 124 25.6 mg 

RS8 122 21.9 mg 

RS10 147 27.7 mg 

RS17 106 20.0 mg 

RS20 114 23.8 mg 

RS21 100 22.7 mg 

RS22 127 21.5 mg 



AGP activity assays reflect a mean of three replicates performed using an extract 
prepared from a minimum of 10 developing seeds. Activities are expressed relative to the 
average value obtained for varietal control plant M202. Tl seed weights are averages of a 
random subsample of mature Tl seeds harvested from individual TO transgenic lines. 

At the AGP activity level, the majority of the Sh2-Rev6-HS transgenic rice lines have 
significant increases relative to M202. Lines RS17 and RS21 do not have significant 
increases in AGP activity. Line RS10 exhibits the highest level of overexpression of all lines 
at the RNA level and also has highest extractable AGP activity. 

Exam ple 7— RSI Tl Grow th Chamber Yield Study 

Sixteen Tl plants (numbered 1, 3, 4, 5, 6, 7, 10, 13, 15, 17, 18, 19, 20, 22, 23, and 25, 
respectively) representing Sh2-Rev6-HS transgenic rice line RSI were grown in a growth 
chamber and compared with five M202 and 5 of control transgenic line 97-3 (the 97-3 line 
carries only hygromycin resistance). The sixteen RSI Tl plants and the 5 97-3 plants came 
from individual seeds germinated on petri plates using hygromycin selection and were then 
transplanted into soil. The 97-3 plants are homozygous for a hygromycin resistance gene 
locus and the RSI Tl plants are heterozygous (12 of 16) or homozygous (4 of 16) for the 
hy&omycm/Sh2-Rev6-HS transgene locus. The dosage of each RSI Tl plant was detemiined 
by progeny tests. RS 1 plants 10, 18, 19, and 20 are homozygous. Difficulty in establishing 
the M202 plants may be a consequence of their being direct seeded into soil. The results are 
shown in Table IV below. 
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10 



15 



20 



25 



30 



35 



Table IV. 




Genotype 


Panicle # 


Total 


Total Seed 


SeedWt./ 


Seeds/ 


Individual 


Harvest 




Seed# 


Wt. 


Panicle 


Panicle 


Seed Wt. 


Index 


H/T?ft? 1 
IVlXUi 1 




185 


9 94 


0.62 


24.06 


0.0258 


0.31 


L 


19 


409 


1 1 04 


0.92 


34.08 


0.0270 


0.39 


\ 

J 


1 5 




17 11 


1.14 


43.13 


0.0264 


0.38 


A 
*• 


14 


41 1 


10 46 


0.75 


29.50 


0.0253 


0.39 




94 


7^1 


70 07 


0 84 


31 29 


0.0267 


0.35 


avg 


IK 7 


^71 


11 774 
XJ. /x*t 


U.OJ 


12 41 


0.0263 


036 


std 


4.12 


14932 


4.10 


0.17 


6.28 


0.0006 


0.03 


Q7 1 1 


11 




90 01 


0 65 


25.45 


0.0254 


0.35 




19 


1040 


97 


0.84 


32.50 


00260 


0.38 


j 


98 


688 
uoo 


17 85 


0.64 


24.57 


0.0259 


0.32 


A 
*# 


1 8 


693 


16 08 


0.89 


34.61 


0.0258 


0.37 


C 




895 


23 57 


0.74 


27.97 


0.0263 


0.37 


avg 


IS 9 


807 


20.902 


0.75 


29.02 


0.0259 


0.36 






148 S8 


3 94 


0.10 


3.92 


0.0003 


0.02 


RSI 1 


33 


943 


24.35 


0.74 


28.58 


0.0258 


0.34 


3 


18 


686 


17.48 


0.97 


38.11 


0.0255 


0.30 


4 


26 


932 


25.91 


1.00 


35.85 


0.0278 


0.36 


5 


14 


540 


14.51 


1.04 


38.57 


0.0269 


0.34 


6 


33 


891 


24.76 


0.75 


27.00 


0.0278 


0.34 


7 


32 


889 


23.97 


0.75 


27.78 


0.0270 


0.36 


10 


20 


587 


15.23 


0.76 


29.35 


0.0259 


0.31 


13 


12 


562 


14.63 


1.22 


46.83 


0.0260 


0.32 


15 


20 


585 


15.14 


0.76 


29.25 


0.0259 


0.32 


17 


17 


678 


17.84 


1.05 


39.88 


0.0263 


0.33 




9 


538 


13.94 


1.55 


59.78 


0.0259 


0.40 


19 


23 


609 


16.05 


0.70 


26.48 


0.0264 


0.33 


20 


19 


594 


15.96 


0.84 


31.26 


0.0269 


0.31 


22 


11 


553 


14.25 


1.30 


50.27 


.0.0258 


■ 0.39 


23 


18 


559 


14.77 


0.82 


31.06 


0.0264 


0.31 


25 


19 


708 


19.27 


1.01 


37.26 


0.0272 


0.36 


avg 


20.25 


67838 


18.00 


0.95 


36.08 


0.0265 


034 


std 


7.28 


144.96 


4.14 


0.23 


9.17 


0.0007 


0.03 


RSI Compared to M202 


1.25 


130 


131 


1.12 


1.11 


1.01 


0.94 


RSI Compared to 97-3 




1 0.72 


0.84 


0.86 


1.27 


1.24 


1.02 


0.94 
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While this initial study of RSI indicates variability between and within genotypes, 
some observations may be valid. First, that RSI Tl plants averaged greater seed weight per 
parade than either control genotype. Second, that RSI TPs averaged a greater number of 
seeds per panicle than either control genotype. This component of yield, seed number per 
panicle, is the largest positively affected parameter in the wheat transformation experiments 
that have been performed using Sh2-Rev6-HS. 

The foregoing detailed description has been given for clearness of understanding only 
and no unnecessary limitations should be understood therefrom as modifications will be 
obvious to those skilled in the art While the invention has been described in connection with 
specific embodiments thereof, it will be understood that it is capable of further modifications 
and this application is intended to cover any variations, uses, or adaptations of the invention 
Mowing, in general, the principles of the invention and including such departures from the 
present disclosure as come within known or customary practice within the art to which the 
invention pertains and as may be applied to the essential features hereinbefore set forth and 
as follows in the scope of the appended claims. 
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Claims 

I claim: 

1 1 . A method of increasing the number of seeds produced by a plant, wherein the 

2 method comprises : 

3 a. introducing into a plant a nucleic acid operably linked to a promoter, 

4 wherein the nucleic acid is selected from the group consisting of a 

5 nucleic acid comprising SEQ ID NO: 3, a nucleic acid which 

6 hybridizes with SEQ ID NO: 3 under high stringency conditions and 

7 encodes a polypeptide that retains biological activity of SH2-REV6- 

8 HS, a fragment of SEQ ID NO: 3 encoding a peptide that retains 

9 biological activity of SH2-REV6-HS, a nucleic acid encoding a 

10 polypeptide comprising SEQ ID NO: 4, or a fragment thereof that 

1 1 retains biological activity of SH2-REV6-HS, and a nucleic acid 

1 2 encoding an SH2HS or an SH2RTS polypeptide; and 

1 3 b. growing the plant produced in step a. 

1 2. A method of increasing the biomass produced by a plant, wherein the method 

2 comprises: 

3 a. introducing into a plant a nucleic acid operably linked to a promoter, 

4 wherein the nucleic acid is selected from the group consisting of a 

5 nucleic acid comprising SEQ ID NO: 3, a nucleic acid which 

6 hybridizes with SEQ ID NO: 3 under high stringency conditions and 

7 encodes a polypeptide that retains biological activity of SH2-REV6- 

8 HS, a fragment of SEQ ID NO: 3 encoding a peptide that retains 

9 biological activity of SH2-REV6-HS, a nucleic acid encoding a 

10 polypeptide comprising SEQ ID NO: 4, or a fragment thereof that 
1J retains biological activity of SH2-REV6-HS, and a nucleic acid 

12 encoding.an SH2HS polypeptide; and 

1 3 b. growing the plant produced in step a. 



WO 01/64928 



PCT/US01/06622 



54 

1 3. A method of increasing the Harvest Index of a plant, wherein the method 

2 comprises: 

3 a. introducing into a plant a nucleic acid operably linked to a promoter, 

4 wherein the nucleic acid is selected from the group consisting of a 

5 nucleic acid comprising SEQ ID NO: 3, a nucleic acid which 

6 hybridizes with SEQ ID NO: 3 under high stringency conditions and 

7 encodes a polypeptide that retains biological activity of SH2-REV6- 

8 HS, a fragment of SEQ ID NO: 3 encoding a peptide that retains 

9 biological activity of SH2-REV6-HS, a nucleic acid encoding a 

10 polypeptide comprising SEQ ID NO: 4, or a fragment thereof that 

1 1 retains biological activity of SH2-REV6-HS, and a nucleic acid 

1 2 encoding an SH2HS polypeptide; and 

1 3 b. growing the plant produced in step a. 

1 • 4. The method of claim 1 , 2, or 3 wherein the plant is a monocotyledonous plant. 

1 5. The method of claim 4 wherein the plant is selected from the group consisting 

2 of rice, wheat, barley, oats, soighum, and millet plant. 

1 6. The method of claim 1, 2, or 3 wherein the plant is a dicotyledonous plant. 

1 7. The method of claim 6 wherein the plant is selected from the group consisting 

2 of pea, alfalfa, birdsfoot trefoil, chickpea, chicory, clover, kale, lentil, prairie grass, small 

3 bumet, soybean, and lettuce plant 

1 8. A method of increasing the flag leaf weight of a monocotyledonous plant, 

-2 —wherein the method comprises: * — . 

3 a. introducing into a plant a nucleic acid operably linked to a promoter, 

4 wherein the nucleic acid is selected from the group consisting of a 

5 nucleic acid comprising SEQ ID NO: 3, a nucleic acid which 

6 hybridizes with SEQ ID NO: 3 under high stringency conditions and 
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7 encodes a polypeptide that retains biological activity of SH2-REV6- 

8 HS, a fragment of SEQ ID NO: 3 encoding a peptide that retains 

9 biological activity of SH2-REV6-HS, a nucleic acid encoding a 

10 polypeptide comprising SEQ ID NO: 4, or a fragment thereof that 

1 1 retains biological activity of SH2-REV6-HS, and a nucleic acid 

1 2 encoding an SH2HS polypeptide; and 

13 b. growing the plant produced in step a. 

1 9. A method of increasing the number of seed heads produced by a 

2 monocotyedonous plant, wherein the method comprises: 

3 a. introducing into a plant a nucleic acid operably linked to a promoter, 

4 wherein the nucleic acid is selected from the group consisting of a 

5 nucleic acid comprising SEQ ED NO: 3, a nucleic acid which 

6 hybridizes with SEQ ID NO: 3 under high stringency conditions and 

7 encodes a polypeptide that retains biological activity of SH2-REV6- 

8 HS, a fragment of SEQ ID NO: 3 encoding a peptide that retains 

9 biological activity of SH2-REV6-HS, a nucleic acid encoding a 

10 polypeptide comprising SEQ ID NO: 4, or a fragment thereof that 

1 1 retains biological activity of SH2-REV6-HS, and a nucleic acid 

1 2 encoding an SH2HS polypeptide; and 

13 b. growing the plant produced in step a. 

1 1 0, A method of increasing two or more traits of a dicotyledonous plant, wherein 

2 the traits are selected from the group consisting of the number of seeds, average seed weight, 

3 total seed weight, number of seed heads, Harvest Index and total plant weight, wherein the 

4 method comprises: 

- 5 , a.. introducing into a plant a nucleic acid operably linked to a promoter, , 

6 . - wherein the nucleic acid is selected from the group consisting of a 

7 nucleic acid comprising SEQ ID NO: 3, a nucleic acid which 

8 hybridizes with SEQ ID NO: 3 under high stringency conditions and 

9 encodes a polypeptide that retains biological activity of SH2-REV6- 
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10 HS, a fragment of SEQ ED NO: 3 encoding a peptide that retains 

1 1 biological activity of SH2-REV6-HS, a nucleic acid encoding a 

12 polypeptide comprising SEQ ID NO: 4, or a fragment thereof that 

13 retains biological activity of SH2-REV6-HS, and a nucleic acid 

1 4 encoding an SH2HS polypeptide; and 

1 5 b. growing the plant produced in step a. 

1 1 1 . A method of increasing the yield of two or more traits of a monocotyledonous 

2 plant, wherein the traits are selected from the group consisting of the number of seeds, 

3 average seed weight, total seed weight, number of seed heads, flag leaf weight, Harvest Index 

4 and total plant weight, wherein the method comprises: 

5 a. introducing into a plant a nucleic acid operably linked to a promoter, 

6 wherein the nucleic acid is selected from the group consisting of a 

7 nucleic acid comprising SEQ ED NO: 3, a nucleic acid which 

8 hybridizes with SEQ ID NO: 3 under high stringency conditions and 

9 encodes a polypeptide that retains biological activity of SH2-REV6- 

10 HS, a fragment of SEQ ED NO: 3 encoding a peptide that retains 

1 1 biological activity of SH2-REV6-HS, a nucleic acid encoding a 

12 polypeptide comprising SEQ ID NO: 4, or a fragment thereof that 

13 retains biological activity of SH2-REV6-HS, and a nucleic acid 

1 4 encoding an SH2HS polypeptide; and 

1 5 b. growing the plant produced in step a. 

1 12. The method of claim 1, 2, 3, 8, 9, 10, or 1 1 further comprising crossing the 

2 plant obtained in step b with a second plant and harvesting and growing the seed which is 

3 produced as a result of making the cross. 

1 ...-.-13.. The method of claim 1 , 2, 3, 8, 9, 1 0, or 1 1 further comprising harvesting seed 

2 which is produced by selfing the plant obtained in step b and growing the harvested seed. 
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1 14. The method of claim 8, 9, or 1 1 wherein the plant is selected from the group 

2 consisting of rice, wheat, barley, oats, sorghum, and millet plant. 

1 15. The method of claim 1, 2, 3, 8, 9, 10, or 1 1 wherein the SH2HS polypeptide 

2 is selected from the group consisting of SIGHS 13, SH2HS14, SH2HS16, SH2HS33, 

3 SH2HS39, SH2HS40, and SH2HS47, or a fragment of the SH2HS polypeptide that retains 

4 biological activity of the SH2HS polypeptide. 

1 16. The method of claim 1, 2, 3, 8, 9, 10, or 1 1 wherein the SH2RTS polypeptide 

2 is selected from the group consisting of SH2RTS48-2 and SH2RTS60-1 , or a fragment of the 

3 SH2RTS polypeptide that retains biological activity of the SH2RTS polypeptide. 

1 17. The method of claim 10 wherein the plant is selected from the group 

2 consisting of pea, alfalfa, birdsfoot trefoil, chickpea, chicory, clover, kale, lentil, prairie grass, 

3 small bumet, soybean, and lettuce plant. 

1 1 8. A plant produced by the method of claim 1 , 2, 3, 8, 9, 10, or 1 1 . 



1 19. A plant comprising a nucleic acid encoding the amino acid sequence as set 

2 forth in SEQ ID NO: 4. 
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SEQUENCE LISTING 

<110> Giroux, Michael 

<120> Transgenic Plants with Increased Seed Yield, Biomass and Harvest Index 
<130> RDI-100 

<140> US 09/516,250 
<141> 2000-03-01 

<160> 6 

<170> Patentln Ver. 2.1 

<210> 1 

<211> 1582 

<212> DNA 

<213> Zea mays 

<220> 

<221> CDS 

<222> (10) . . (1563) 

<223> Shrunken-2 gene revertant form 
<220> 

<221> variation 
<222> (267) 

<223> k = g or t; amino acid 86 « Ala. 
<220> 

<221> variation 
<222> (1368) 

<223> r = a or g; amino acid 453 - Pro. 
<220> 

<221> variation 
<222>'(1578) 
<223> k = g or t. 

<400> 1 

ggaggagat atg cag ttt gca ctt gca ttg gac acg aac tea ggt cct cac 51 

Met Gin Phe Ala Leu Ala Leu Asp Thr Asn Ser Gly Pro His 
1 5 10 

cag ata aga tct tgt gag ggt gat ggg att gac agg ttg gaa aaa tta 99 
Gin He Arg Ser Cys Glu Gly Asp Gly He Asp Arg Leu Glu Lys Leu 
15 " 20 ' ~ 25 30 

agt att ggg ggc aga aag cag gag aaa get ttg aga aat agg tgc ttt 147 
Ser He Gly Gly Arg Lys Gin Glu Lys Ala Leu Arg Asn Arg Cys Phe 
35 40 45 

ggt ggt aga gtt get gca act aca caa tgt att ctt acc tea gat get 195 
Gly Gly Arg Val Ala Ala Thr Thr Gin Cys He Leu Thr Ser Asp Ala 
50 55 60 

tgt cct gaa act ctt cat tct caa aca cag tec tct agg aaa aat tat 243 
Cys Pro Glu Thr Leu His Ser Gin Thr Gin Ser Ser Arg Lys Asn Tyr 
65 70 75 
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get gat gca aac cgt gta tct gck ate att ttg ggc gga ggc act gga 291 

Ala Asp Ala Asn Arg Val Ser Xaa He He Leu Gly Gly Gly Thr Gly 
80 85 90 

tct cag etc ttt cct ctg aca age aca aga get acg cct get gta cct 

Ser Gin Leu Phe Pro Leu Thr Ser Thr Arg Ala Thr Pro Ala Val Pro 
95 100 105 110 

gtt gga gga tgt tac agg ctt att gat ate cct atg agt aac tgc ttc 387 

Val Gly Gly Cys Tyr Arg Leu He Asp He Pro Met Ser Asn Cys Phe 

115 " 120 125 

aac agt ggt ata aat aag ata ttt gtg atg agt cag ttc aat tct act 435 

Asn Ser Gly He Asn Lys He Phe Val Met Ser Gin Phe Asn Ser Thr 

130 135 140 

teg ctt aac cgc cat att cat cgt aca tac ctt gaa ggc ggg ate aac 483 

Ser Leu Asn Arg His He His Arg Thr Tyr Leu Glu Gly Gly He Asn 
145 150 155 

ttt get gat gga tct gta cag gta tta gcg get aca caa atg cct gaa 531 

Phe Ala Asp Gly Ser Val Gin Val Leu Ala Ala Thr Gin Met Pro Glu 
160 "* 165 170 



gag cca get gga tgg ttc cag ggt aca gca gac tct ate aga aaa ttt 57 9 

Glu Pro Ala Gly Trp Phe Gin Gly Thr Ala Asp Ser He Arg Lys Phe 
175 " 180 185 190 

ate tgg gta etc gag gat tat tac agt cac aaa tec att gac aac att 627 

He Trp Val Leu Glu Asp Tyr Tyr Ser His Lys Ser He Asp Asn lie 

195 200 205 

gta ate ttg agt ggc gat cag ctt tat egg atg aat tac atg gaa ctt 675 

Val lie Leu Ser Gly Aso Gin Leu Tyr Arg Met Asn Tyr Met Glu Leu 

210 " 215 220 

gtg cag aaa cat gtc gag gac gat get gat ate act ata tea tgt get 723 

Val Gin Lys His Val Glu Asp Asp Ala Asp lie Thr lie Ser Cys Ala 

225 230 235 

cct gtt gat gag age cga get tct aaa aat ggg eta gtg aag att gat 771 

Pro Val Asp Glu Ser Arg Ala Ser Lys Asn Gly Leu Val Lys He Asp 
240 245 250 

cat act gga cgt gta ctt caa ttc ttt gaa aaa cca aag ggt get gat 819 

His Thr Gly Arg Val Leu Gin Phe Phe Glu Lys. Pro Lys Gly Ala Asp 
.255 * 260 . 265 270 

ttg aat tct atg aga gtt gag ace aac ttc ctg age tat get ata gat 867 

Leu Asn Ser Met Arg Val Glu Thr Asn Phe Leu Ser Tyr Ala He. Asp 

275 280 285 

gat gca cag aaa tat cca tac ctt gca tea atg ggc att -tat gtc -ttc 915 

Asp Ala Gin Lys Tyr Pro Tyr Leu Ala Ser Met Gly lie Tyr Val Phe 

290 295 300 

aag aaa gat gca ctt tta gac ctt etc aag tea aaa tat act caa tta 963 

Lys Lys Asp Ala Leu Leu Asp Leu Leu Lys Ser Lys Tyr Thr Gin Leu 

305 310 315 
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cat aac ttt qga tct gaa ate etc cca aga get gta eta gat cat agt 1011 
His Asp Phe Gly Ser Glu He Leu Pro Arg Ala Val Leu Asp His Ser 
320 325 330 

qtg cag gca tgc att ttt acg ggc tat tgg gag gat gtt gga aca ate 
Va! Gin Ala Cys lie Phe Thr Gly Tyr Trp Glu Asp Val Gly Thr lie 
335 * 340 345 350 

aaa tea ttc ttt gat gca aac ttg gee etc act gag cag cct tec aag 
Lys Ser Phe Phe Asp Ala Asn Leu Ala Leu Thr Glu Gin Pro Ser Lys 
J 355 360 365 

ttt oat ttt tac gat cca aaa aca cct ttc ttc act gca ccc cga tgc 
Phe Asp Phe Tyr Asp Pro Lys Thr Pro Phe Phe Thr Ala Pro Arg Cys 
370 375 380 

ttg cct ccg acg caa ttg gac aag tgc aag atg aaa tat gca ttt ate 
Leu Pro Pro Thr Gin Leu Asp Lys Cys Lys Met Lys Tyr Ala Phe He 
385 390 395 

tea gat ggt tgc tta ctg aga gaa tgc aac ate gag cat tct gtg att 
Ser Asp Gly Cys Leu Leu Arg Glu Cys Asn lie Glu Has Ser Val He 
400 405 410 

gga gtc tgc tea cgt gtc age tct gga tgt gaa etc aag gac tec gtg 
Gly Val Cys Ser Arg Val Ser Ser Gly Cys Glu Leu Lys Asp Ser Val 
415 420 425 430 

atg atg gga gcg gac ate tat gaa act gaa gaa gaa get tea aag eta 1347 
Met Met Gly Ala Asp lie Tyr Glu Thr Glu Glu Glu Ala Ser Lys Leu 
435 440 445 

ctg tta get ggg aag gtc ccr gtt gga ata gga agg aac aca aag ata 1395 
Leu Leu Ala Gly Lys Val Xaa Val Gly lie Gly Arg Asn Thr Lys He 
450 455 460 

aqq aac tgt ate att gac atg aat get agg att ggg aag aac gtg gtg 1443 
Arg Asn Cys He He Asp Met Asn Ala Arg He Gly Lys Asn Val Val 



465 



470 475 



1059 



1107 



1155 



1203 



1251 



1299 



1491 



1539 



ate aca aac agt aag ggc ate caa gag get gat cac ccg gaa gaa ggg 

lie Thr Asn Ser Lys Gly lie Gin Glu Ala Asp His Pro Glu Glu Gly 

480 ' 485 490 

tac teg tac tac ata agg tct gga ate gtg gtg ate ctg aag aat gca 

Tyr Ser Tyr Tyr He Arg Ser Gly lie Val Val lie Leu Lys Asn Ala 
495 * 500 505 510 

ace ate aac gat ggg tct gtc ata tagategget gcgtktgcg 1582 

Thr lie Asn Asp Gly Ser Val He 
515 



<210> 2 
<211> 518 
<212> PRT 
<213> Zea mays 

<220> 
<221> - 
*c22 2 ^(86) 

<223> Xaa = Ala, from degeneracy at position 267 in DNA sequence (k - g or t) 



WO 01/64928 



PCT/US01/06622 



<220> 
<221> - 
<222> (453) 

<223> Xaa = Pro, from degeneracy at position 1368 in DNA sequence (r - a or 

g) • 

<400> 2 

Met Gin Phe Ala Leu Ala Leu Asp Thr Asn Ser Gly Pro His Gin He 
15 10 15 

Arg Ser Cys Glu Gly Asp Gly He Asp Arg Leu Glu Lys Leu Ser He 
20 25 30 

Gly Gly Arg Lys Gin Glu Lys Ala Leu Arg Asn Arg Cys Phe Gly Gly 
35 40 45 

Arg Val Ala Ala Thr Thr Gin Cys He Leu Thr Ser Asp Ala Cys Pro 
50 55 60 

Glu Thr Leu His Ser Gin Thr Gin Ser Ser Arg Lys Asn Tyr Ala Asp 
65 70 75 80 

Ala Asn Arg Val Ser Xaa He He Leu Gly Gly Gly Thr Gly Ser Gin 
85 90 95 

Leu Phe Pro Leu Thr Ser Thr Arg Ala Thr Pro Ala Val Pro Val Gly 
100 105 110 

Glv Cys Tyr Arg Leu He Asp He Pro Met Ser Asn Cys Phe Asn Ser 
115 120 125 

Gly He Asn Lys He Phe Val Met Ser Gin Phe Asn Ser Thr Ser Leu 
130 135 140 

Asn Arg His He His Arg Thr Tyr Leu Glu Gly Gly He Asn Phe Ala 
145 150 155 160 

Asp Gly Ser Val Gin Val Leu Ala Ala Thr Gin Met Pro Glu Glu Pro 
165 170 175 

Ala Gly Trp Phe Gin Gly Thr Ala Asp Ser He Arg Lys Phe lie Trp 
180 185 190 

Val Leu Glu Asp Tyr Tyr Ser His Lys Ser He Asp Asn He Val He 
195 " 200 205 

Leu Ser Gly Asp Gin Leu Tyr Arg Met Asn Tyr Met Glu Leu Val Gin 
210 215 220 

.. Lys His Val Glu Asp Asp Ala Asp lie Thr He Ser Cys Ala Pro Val 
225 230 • 235 240 

-Asp- Glu -Ser -Arg--Al-a-Ser-Lys-Asn-Gly Leu Val Lys- He Asp His Thr - - - 

245 250 255 

Gly Arg Val Leu Gin Phe Phe Glu Lys Pro Lys Gly Ala Asp Leu Asn 
260 265 270 

Ser Met Arg Val Glu Thr Asn Phe Leu Ser Tyr Ala He Asp Asp Ala 
275 280 285 
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Gin Lys Tyr Pro Tyr Leu Ala Ser Met Gly He Tyr Val Phe Lys Lys 
290 295 300 

Asp Ala Leu Leu Asp Leu Leu Lys Ser Lys Tyr Thr Gin Leu His Asp 
305 310 il: > 

Phe Gly Ser Glu lie Leu Pro Arg Ala Val Leu Asp His Ser Val Gin 
325 330 JJ0 

Ala Cys lie Phe Thr Gly Tyr Trp Glu Asp Val Gly Thr lie Lys Ser 

Phe Phe Asp Ala Asn Leu Ala Leu Thr Glu Gin Pro Ser Lys Phe Asp 
355 360 365 

Phe Tyr Asp Pro Lys Thr Pro Phe Phe Thr Ala Pro Arg Cys Leu Pro 
370 375 380 

Pro Thr Gin Leu Asp Lys Cys Lys Met Lys Tyr Ala Phe lie Ser Asp 
385 390 395 

Gly Cys Leu Leu Arg Glu Cys Asn lie Glu His Ser Val He Gly Val 

Cys Ser Arg Val Ser Ser Gly Cys Glu Leu Lys Asp Ser Val Met Met 
420 425 430 

Gly Ala Asp lie Tyr Glu Thr Glu Glu Glu Ala Ser Lys Leu Leu Leu 
435 440 44b 

Ala Gly Lys Val Xaa Val Gly lie Gly Arg Asn Thr Lys lie Arg Asn 
450 455 460 

Cys lie lie Asp Met Asn Ala Arg He Gly Lys Asn Val Val lie Thr 
465 470 475 480 

Asn Ser Lys Gly He Gin Glu Ala Asp His Pro Glu Glu Gly Tyr Ser 
485 490 49b 

Tyr Tyr lie Arg Ser Gly He Val Val lie Leu Lys Asn Ala Thr lie 
500 505 510 

Asn Asp Gly Ser Val He 
515 



<210> 3 
<211> 1582 
<212> DNA 
<213> Zea mays 



<220> 

-<221>- CDS - * ■ 

<222> (10) . . (1563) 

<223> Shrunken-2 gene revertant form, modified to De 
heat stable 

<220> 

<221> variation 
<222> (267) 

<223> k = g or t; amino acid 86 = Ala. 
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<220> 

<221> variation 
<222> (1008) 
<223> y = c or t. 



<220> 

<221> variation 
<222> (1368) 

<223> r = a or g; amino acid 453 = Pro. 
<220> 

<221> variation 
<222> (1578) 
<223> k = g or t. 



<400> 3 

ggaggagat atg cag ttt gca ctt gca ttg gac acg aac tea ggt cct cac 51 
Met Gin Phe Ala Leu Ala Leu Asp Thr Asn Ser Gly Pro His 
15 10 

cag ata aga tct tgt gag ggt gat ggg att gac agg ttg gaa aaa tta 99 
Gin He Arg Ser Cys Glu Gly Asp Gly lie Asp Arg Leu Glu Lys Leu 
15 20 25 30 

agt att ggg ggc aga aag cag gag aaa get ttg aga aat agg tgc ttt 147 
Ser He Gly Gly Arg Lys Gin Glu Lys Ala Leu Arg Asn Arg Cys Phe 
35 40 45 



ggt ggt. aga gtt get gca act aca caa tgt att ctt ace tea gat get 
Gly Gly Arg Val Ala Ala Thr Thr Gin Cys He Leu Thr Ser Asp Ala 
50 55 60 



195 



tgt cct gaa act ctt cat tct caa aca cag tec tct agg aaa aat tat 243 

Cys Pro Glu Thr Leu His Ser Gin Thr Gin Ser Ser Arg Lys Asn Tyr 

65 70 75 

get gat gca aac cgt gta tct gck ate att ttg ggc gga ggc act gga 2 91 

Ala Asp Ala Asn Arg Val Ser Xaa He He Leu Gly Gly Gly Thr Gly 

80 85 90 

tct cag etc ttt cct ctg aca age aca aga get acg cct get gta cct 339 

Ser Gin Leu Phe Pro Leu Thr Ser Thr Arg Ala Thr Pro Ala Val Pro 

95 100 105 110 

gtt gga gga tgt tac agg ctt att gat ate cct atg agt aac tgc ttc 387 

Val Gly Gly Cys Tyr Arg Leu He Asp He Pro Met Ser Asn Cys Phe 

115 120 125 

aac agt ggt ata aat aag ata ttt-gtg atg agt cag tte aat tct act 435 

Asn Ser Gly lie Asn Lys He Phe Val Met Ser Gin Phe Asn Ser Thr 

130 J 135 140 

'.teg ctt aac cgc cat att cat cgt aca. tac ctt gaa ggc. ggg. ate .aac . .483 

Ser Leu Asn Arg His lie His Arg Thr Tyr Leu Glu Gly Gly lie Asn 

145 150 155 , . . 

ttt get gat gga tct gta cag gta tta gcg get aca caa atg cct gaa 531 

Phe Ala Asp Gly Ser Val Gin Val Leu Ala Ala Thr Gin Met Pro Glu 

160 165 170 
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gag cca get gga tgg ttc cag ggt aca gca gac tct ate aga aaa ttt 57 9 
Glu Pro Ala Gly Trp Phe Gin Gly Thr Ala Asp Ser He Arg Lys Phe 
175 180 185 190 



ate tgg gta etc gag gat tat tac agt cac aaa tec att gac aac att 
He Trp Val Leu Glu Asp Tyr Tyr Ser His Lys Ser He Asp Asn He 
195 200 205 



627 



gta ate ttg agt ggc gat cag ctt tat egg atg aat tac atg gaa ctt 675 
Val He Leu Ser Gly Asp Gin Leu Tyr Arg Met Asn Tyr Met Glu Leu 
210 215 220 

gtg cag aaa cat gtc gag gac gat get gat ate act ata tea tgt get 723 
Val Gin Lys His Val Glu Asp Asp Ala Asp He Thr He Ser Cys Ala 
225 230 235 

cct gtt gat gag age cga get tct aaa aat ggg eta gtg aag att gat 771 
Pro Val Asp Glu Ser Arg Ala Ser Lys Asn Gly Leu Val Lys He Asp 
240 245 250 

cat act gga cgt gta ctt caa ttc ttt gaa aaa cca aag ggt get gat 
His Thr Gly Arg Val Leu Gin Phe Phe Glu Lys Pro Lys Gly Ala Asp 
255 260 265 270 

ttg aat tct atg aga gtt gag acc aac ttc ctg age tat get ata gat 
Leu Asn Ser Met Arg Val Glu Thr Asn Phe Leu Ser Tyr Ala He Asp 
275 280 285 

gat gca cag aaa tat cca tac ctt gca tea atg ggc att tat gtc ttc 
Asp Ala Gin Lys Tyr Pro Tyr Leu Ala Ser Met Gly He Tyr Val Phe 
290 295 300 

aag aaa gat gca ctt tta gac ctt etc aag tea aaa tat act caa tta 
Lys Lys Asp Ala Leu Leu Asp Leu Leu Lys Ser Lys Tyr Thr Gin Leu 
305 310 315 

cat gac ttt gga tct gaa ate etc cca aga get gta eta gat tay agt 
His Asp Phe Gly Ser Glu He Leu Pro Arg Ala Val Leu Asp Tyr Ser 
320 325 330 

gtg cag gca tgc att ttt acg ggc tat tgg gag gat gtt gga aca ate 
Val Gin Ala Cys He Phe Thr Gly Tyr Trp Glu Asp Val Gly Thr He 
335 " 340 345 350 

aaa tea ttc ttt gat gca aac ttg gee etc act gag cag cct tec aag 
Lys Ser Phe Phe Asp Ala Asn Leu Ala Leu Thr Glu Gin Pro Ser Lys 
355 360 365 

ttt gat ttt tac gat cca aaa aca cct ttc ttc act gca ccc cga tgc 
Phe Asp Phe Tyr Asp Pro Lvs Thr Pro Phe Phe Thr Ala Pro Arg Cys 
370 375 380 

ttg cct ccg acg caa ttg gac aag tgc aag atg aaa tat gca ttt ate. 
Leu Pro Pro Thr Gin Leu Asp Lys Cys Lys Met Lys Tyr Ala Phe lie 
385 390 395 



819 



867 



915 



963 



1011 



1059 



1107 



1155 



1203 



tea gat ggt tgc tta ctg aga gaa tgc aac ate gag cat tct gtg att 1251 
Ser Asp Gly Cys Leu Leu Arg Glu Cys Asn lie Glu His Ser Val lie 
400 405 410 
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gga gtc tgc tea cgt gtc age tct gga tgt gaa etc aag gac tec gtg 1299 

Gly Val Cys Ser Arg Val Ser Ser Gly Cys Glu Leu Lys Asp Ser Val 

415 "* 420 425 430 

atg atg gga gcg gac ate tat gaa act gaa gaa gaa get tea aag eta 1347 

Met Met Gly Ala Asp He Tyr Glu Thr Glu Glu Glu Ala Ser Lys Leu 

435 440 445 

ctg tta get ggg aag gtc ccr gtt gga ata gga agg aac aca aag ata 1395 

Leu Leu Ala Gly Lys Val Xaa Val Gly He Gly Arg Asn Thr Lys He 

450 " 455 460 

agg aac tgt ate att gac atg aat get agg att ggg aag aac gtg gtg 1443 

Arg Asn Cys He He Asp Met Asn Ala Arg He Gly Lys Asn Val Val 

465 470 475 

ate aca aac agt aag ggc ate caa gag get gat cac ccg gaa gaa ggg 1491 

He Thr Asn Ser Lys Gly lie Gin Glu Ala Asp His Pro Glu Glu Gly 

480 485 490 

tac teg tac tac ata agg tct gga ate gtg gtg ate ctg aag aat gca 1539 

Tyr Ser Tyr Tyr He Arg Ser Gly lie Val Val lie Leu Lys Asn Ala 

495 * ~ 500 505 510 

acc ate aac gat ggg tct gtc ata tagategget gcgtktgcg 1582 

Thr lie Asn Asp Gly Ser Val lie 
515 



<210> 4 
<211> 518 
<212> PRT 
<213> Zea mays 

<220> 
<221> - 
<222> (86) 

<223> Xaa = Ala, from degeneracy at position 267 in DNA sequence (k = g or t) 

<220> 
<221> - 
<222> (453) 

<223> Xaa = Pro, from degeneracy at position 1368 in DNA sequence (r = a or 

g). 

<400> 4 

Met Gin Phe Ala Leu Ala Leu Asp Thr Asn Ser Gly Pro His Gin He 
1 5 10 15 

Arg Ser Cys Glu Gly Asp Gly He Asp Arg Leu Glu Lys Leu Ser lie 
20 25 30 

Gly- Gly Arg- Lys- Gin Glu Lys Ala Leu Arg Asn Arg Cy> Phe Gly.-GJLy - 

35 40 45 

Arg Val Ala Ala Thr Thr Gin Cys lie Leu Thr Ser Asp Ala Cys Pro 
50 55 60 

Glu Thr Leu His Ser Gin Thr Gin Ser Ser Arg Lys Asn Tyr Ala Asp 
65 70 75 80 
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Ala Asn Arg Val Ser Xaa He He Leu Gly Gly Gly Thr Gly Ser Gin 
" " 85 90 95 

Leu Phe Pro Leu Thr Ser Thr Arg Ala Thr Pro Ala Val Pro Val Gly 
100 105 HO 

Gly Cys Tyr Arg Leu He Asp He Pro Met Ser Asn Cys Phe Asn Ser 
115 120 125 

Gly He Asn Lys He Phe Val Met Ser Gin Phe Asn Ser Thr Ser Leu 
130 " 135 140 

Asn Arg His He His Arg Thr Tyr Leu Glu Gly Gly He Asn Phe Ala 
145 150 155 160 

Asp Gly Ser Val Gin Val Leu Ala Ala Thr Gin Met Pro Glu Glu Pro 
165 170 175 

Ala Gly Trp Phe Gin Gly Thr Ala Asp Ser lie Arg Lys Phe He Trp 
180 185 190 

Val Leu Glu Asp Tyr Tyr Ser His Lys Ser He Asp Asn He Val He 
195 200 205 

Leu Ser Gly Asp Gin Leu Tyr Arg Met Asn Tyr Met Glu Leu Val Gin 
210 ' 215 220 

Lys His Val Glu Asp Asp Ala Asp He Thr He Ser Cys Ala Pro Val 
225 230 235 240 

Asp Glu Ser Arg Ala Ser Lys Asn Gly Leu Val Lys He Asp His Thr 
245 ~" 250 255 

Gly Arg Val Leu Gin Phe Phe Glu Lys Pro Lys Gly Ala Asp Leu Asn 
260 265 270 

Ser Met Arg Val Glu Thr Asn Phe Leu Ser Tyr Ala lie Asp Asp Ala 
275 280 285 

Gin Lys Tyr Pro Tyr Leu Ala Ser Met Gly He Tyr Val Phe Lys Lys 
290 295 300 

Asp Ala Leu Leu Asp Leu Leu Lys Ser Lys Tyr Thr Gin Leu His Asp 
305 310 315 320 

Phe Gly Ser Glu He Leu Pro Arg Ala Val Leu Asp Tyr Ser Val Gin 
325 330 335 

Ala Cys He Phe Thr Gly Tyr Trp Glu Asp Val Gly Thr He Lys Ser 
340 345 350 

Phe Phe Asp Ala Asn Leu -Ala Leu Thr Glu Gin Pro Ser Lys Phe Asp 

355 _ 360 ... . . . -365 ... 

Phe Tyr Asp Pro Lys Thr Pro Phe Phe Thr Ala Pro Arg Cys Leu Pro 
370 375 380 

Pro Thr Gin Leu Asp Lys Cys Lys Met Lys Tyr Ala Phe lie Ser Asp 
385 390 395 400 

Gly Cys Leu Leu Arg Glu Cys Asn He Glu His Ser Val lie Gly Val 
405 410 415 
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Cys Ser Arg Val Ser Ser Gly Cys 
420 

Gly Ala Asp He Tyr Glu Thr Glu 

435 440 

Ala Gly Lys Val Xaa Val Gly He 
450 455 

Cys He He Asp Met Asn Ala Arg 
465 470 

Asn Ser Lys Gly He Gin Glu Ala 
485 

Tyr Tyr He Arg Ser Gly He Val 
500 



Asn Asp Gly Ser Val He 
515 



10 



Glu Leu Lys Asp Ser Val Met Met 
425 430 

Glu Glu Ala Ser Lys Leu Leu Leu 
445 

Gly Arg Asn Thr Lys He Arg Asn 
460 

lie Gly Lys Asn Val Val He Thr 
475 480 

Asp His Pro Glu Glu Gly Tyr Ser 
490 495 

Val He Leu Lys Asn Ala Thr He 
505 510 



<210> 5 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 5 

ctggatgtga actcaaggac tccgtg 



<210> 6 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 



<400> 6 

ggcttaacta tgcggcatca gage 



24 
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